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DNA DEMETHYLASE, THERAPEUTIC AND 
DIAGNOSTIC USES THEREOF 

BACKGROUND OF THE INVENTION 

5 (a) Field of the Invention 

The invention relates to a novel enzyme, DNA 
demethylase, therapeutic and diagnostic uses thereof, 
(b) Description of Prior Art 

Many lines of evidence have established that 

10 modification of cytosine moieties residing in the dinu- 
cleotide sequence CpG in vertebrate genomes is involved 
in regulating a number of genome functions such as 
parental imprinting, X~ inact ivat ion, suppression of 
methylation of ectopic genes and differential gene 

15 expression (Szyf, M. (1996) Pharmacol. Ther . 70, 1-37). 
DNA methylation performs its function of differentially 
marking genes because the distribution of methylated 
CpGs is tissue- and site- specific forming a pattern of 
methylation (Szyf/ M. (1996X Pharmacol. Ther. 70, 1- 

20 37) . It is clear that the pattern of methylation is 
fashioned by a sequence of methylation and demethyla- 
tion events (Brandeis, M. et al . (1993) Bioassays 15, 
709-713) during development and is maintained in the 
fully differentiated cell (Razin, A. et al . (1980) Sci- 

25 ence 210, 604-610) . While it was originally suggested 
that DNA demethylation is accomplished by a passive 
loss of methyl groups during replication (Razin, A. et 
al. (1980) Science 210, 604-610), it is now clear that 
an active process of demethylation occurs in embryonal 

30 cells (Frank, D. et al . (1991) Nature 351, 239-241), in 
differentiating cell lines (Razin, A. et al . (1986) 
Proc. Natl. Acad. Sci . USA 83, 2827-2831; Szyf, M. et 
al. (1985) Proc. Natl. Acad. Sci. USA 82, 8090-8094) 
and in response to estrogen treatment (Saluz, H.P. et 

35 al. (1986) Proc. Natl. Acad. Sci. USA 83, 7167-7171). 
Two modes of demethylation have been documented: site 
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specific demethylation that coincides in many instances 
with onset of gene expression of specific genes and a 
general genome wide demethylation that occurs during 
early development in vivo during cellular differentia- 
tion and in cancer cells (Feinberg, A. P. et al . (1983) 
Nature 301, 89-92; Razin, A. et al . (1986) Proc . Natl. 
Acad. Sci. USA 83, 2827-2831). The global demethyla- 
tion is consistent with the hypothesis that a general 
demethylase activity which is activated at specific 
points in development or oncogenesis exists. It has 
been hypothesized that one mechanism regulating the 
pattern of methylation is the control of expression of 
methyltransferase (Szyf, M. (1991) Biochem. Cell Biol. 
69. 764-161) and demethylase activities (Szyf, M. (1994) 
Trends Pharmacol. Sci. 7. 233-238). Although exten- 
sive information has been obtained on the enzymatic 
activity responsible for methylation and the regulation 
of its expression in the last two decades (Szyf, M. 
(1996) Pharmacol. Ther. 70, 1-37), the identity of the 
demethylase has remained a mystery. It is clear how- 
ever that to fully understand how patterns of methyla- 
tion are formed and maintained and to determine their 
role in development, physiology and oncogenesis, one 
has to identify the demethylase enzyme (s). Two main 
difficulties have inhibited the identification of this 
enzyme. First, it is believed that demethylation of a 
methylated cytosine is chemically highly unlikely since 
it involves breaking a very stable C-C bond. Second, 
demethylation occurs at very defined stages in develop- 
ment (Brandeis, M. et al . (1993) Bioassays 15, 709-713) 
and identifying an adequate tissue source for this 

enzyme is critical . 

Whereas no bona fide demethylase has been iden- 
tified to date, alternative biochemical mechanisms 
35 involving exchange of methylated cytosines with non- 
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methylated cytosines have been described. One previ- 
ously proposed mechanism is removal of the methylated 
base by a glycosylase and its replacement with a non- 
methylated nucleotide utilizing an "excision-repair" 
5 mechanism (Razin, A. et al . (1986) Proc . Natl. Acad. 
Sci. USA 83, 2827-2831). Glycosylase activities that 
can remove methylated cytosines from DNA have been dem- 
onstrated by Vairapandi and Duker ' (Vairapandi , M. et 
al. (1993) Nucl. Acids Res. 21. 5323-5327) and more 

10 recently by Jest (Jost, J. P. et al . (1995) J. Biol. 
Chem. 270, 9734-9739) . However it is not clear 

whether this activity is responsible for the general 
demethylation observed in cellular differentiation. 
The fact that the activity identified by Jost acts spe- 

15 cifically on hemimethylated sequences (which is not the 
natural substrate in most cases) and can remove thymi- 
dines as well as 5-methylcytosines , supports a repair 
function for this glycosylase-demethylase (Jost, J. P. 
et al. (1995) J. Biol. Chem. 270, 9734-9739). An 

20 alternative mechanism involving a RNA dependent activ- 
ity has been recently described by Weiss et al . (Wexss 
et al . , 1996). This proteinase - insensitive RNA depend- 
ent activity has been shown to catalyze the excision 
and replacement of a methylated CpG dinucleotide with a 

25 nonmethylated CpG dinucleotide that is contained in 

DNA-RNA hybrid molecule (Weiss, A. et al . (1996) Cell 
87, 709-718) . This activity which was identified in 
differentiating cells in culture was proposed to be 
involved in demethylation during development . These 

30 previous findings demonstrate that the common accepted 
model in the filed has been that a bona fide demethy- 
lase does not exist. 

It has been previously proposed that the exten- 
sive hypomethylation observed in cancer cells might be 

35 a consequence of activation of demethylase activity by 



a 
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oncogenic pathways (Szyf, M.(1994) Trends Pharmacol. 
Sci. 7, 233-238; Szyf, M. et al . (1995) J. Biol. Chetn. 
270, 12690-12696) . In accordance with this hypothesis 
we have shown that ectopic expression of v-Ha-ras had 
induced demethylation activity in the cells (Szyf, M. 
et al. (1995) J. Biol. Chem. 270, 12690-12696). Using 
an assay that directly measures the conversion of 3'"P 
labeled methyl dCMP (mdCMP) into dCMP, we have shown 
that nuclear extracts prepared from P19-Ras transfec- 
tants bear high levels of demethylase activity (Szyf, 
M. et al. (1995) J. Biol. Chem. 270, 12690-12696). 
Building on this observation, we hypothesized that can- 
cer cell lines were a good source for demethylase. 
However, it is not evident that Ras expression in pl9 
cells does reflect the situation in cancer cells. P19 
is an embryonic cell and expression of Ras might be 
differentiating them. 

■It would be highly desirable to be provided with 
a bona fide DNA demethylase (DNA dMTase) to alter 
20 developmental programs for therapeutic and biological 
use . 
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SUMMARY OF THE INVENTION 

In accordance with the present invention, we 
demonstrate the purification of a bona fide DNA demeth- 
ylase (DNA dMTase) from a human lung cancer cell line 
A549, determine its kinetic parameters and substrate 
specificity. The DNA dMTase activity identified in 
this study converts methyl -dCMP (mdCMP) residing in the 
dinucleotide -sequence mdCpG into dCMP whereas the 
methyl group is released as a volatile residue which 
was identified to be methanol. The activity is puri- 
fied away from any trace amounts of dCTP, is insensi- 
tive to the DNA polymerase inhibitor ddCTP, is not 
affected by the presence of methyl dCTP (mdCTP) in the 
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reaction and does not exhibit exonuclease or glyco- 
sylase activities. The identification of this new 
enzyme points out to new directions in our understand- 
ing of how DNA methylation patterns are formed and 
5 altered. 

One aim of the present invention is to provide a 
Jbona fide DNA demethylase (DNA dMTase) . 

In accordance with the present invention there 
is provided a DNA demethylase enzyme having about 
10 4 0 KDa, and wherein the DNA demethylase enzyme is over- 
expressed in cancer cells and not in normal cells. 

In accordance with the present invention there 
is provided a cDNA encoding human demethylase which 
comprises a sequence set forth in SEQ ID N0:1. 
15 In accordance with the present invention there 

is provided two mouse cDNAs homologous to the human 
cDNA, wherein the cDNA encoding mouse demethylase hav- 
ing a sequence set forth in SEQ ID NOS:5-7. 

In accordance with the present invention there 
2 0 is provided a different human cDNA which encodes a pro- 
tein homologous to the human demethylase having a 
sequence set forth in SEQ ID NO: 3. 

In accordance with the present invention there 
is provided the use of the expression of demethylase 
25 cDNAs to alter DNA methylation patterns of DNA in vitro 
in cells or in vivo in humans, animals and in plants. 

The demethylase cDNAs expression may be under 
the direction of mammalian promoters, such as CMV. 

The demethylase cDNAs expression may be under 
30 plant specific, promoters to alter methylation in plants 
and to allow for altering states of development of 
plants and expression of foreign genes in plants. 

The demethylase cDNAs expression may be in the 
antisense orientation to inhibit demethylase in cancer 
35 cells for therapeutic processes. 
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The expression of demethylase cDNA in mammalian 
cells may be to alter their differentiation state and 
to generate stem cells for therapeutics, cells for ani- 
mal cloning and to improve expression of foreign genes. 
5 in accordance with the present invention there 

is provided the use of the expression of demethylase 
cDNAs in bacterial or insect cells for production of 
large amounts of demethylase. 

In accordance with the present invention there 
10 is provided the use of the expression of demethylase 
cDNAs for the production of protein in vertebrate, 
insect or bacterial or plant cells, such as antibodies 
against demethylase . 

In accordance with the present invention there 
15 is provided the use of the sequence of demethylase 
cDNAs as a template to design antisense oligonucleo- 
tides and ribozymes. 

In accordance with the present invention there 
is provided the use of the predicted peptide sequence 
20 of demethylase cDNAs to produce polyclonal or mono- 
clonal antibodies against demethylase. 

In accordance with the present invention there 
is provided the use of expression of cDNAs in two 
hybrid systems in yeast to identify proteins interact- 
25 ing with demethylase for diagnostic and therapeutic 
purposes . 

in accordance with the present invention there 
is provided the use of expression of cDNAs in bacte- 
rial, vertebrate or insect cells to produce large 
3 0 amounts of demethylase for obtaining a x-ray crystal 
structure and for high throughput screening of demethy- 
lase inhibitors for therapeutics and biotechnology. 

In accordance with the present invention there 
is provided a volatile assay for high throughput 
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screening of demethylase inhibitors as therapeutics and 
anticancer agents which comprises the steps of: 

a) using transcribed and translated demethylase 
cDNAs in vitro to convert methyl -cytosine pres- 
ent in methylated DNA samples to cytosine pres- 
ent in DNA and volatilize methyl group; 

b) determining the absence or minute amount of 
volatilize methyl group as 'an indication of an 
active demethylase inhibitor. 

In accordance with the present invention there 
is provided a volatile assay for the diagnostics of 
cancer in a patient sample which comprises the steps 
of: 

a) determining demethylase activity in patient sam- 
15 pies by assaying conversion of methyl -cytosine 

present in methylated DNA to cytosine present in 
DNA and its volatilization as methyl groups 
released as methanol ; 

b) determining the presence or minute amount of 
20 volatilized methyl released as methanol groups 

as an indication of cancer in the patient sam- 
ple . 

In accordance with the present invention there 
is provided the use of an antagonist or inhibitor of 
25 DNA demethylase for the manufacture of a medicament for 
cancer treatment, for restoring an aberrant methylation 
pattern in a patient DNA, or for changing a methylation 
pattern in a patient DNA. 

Such an antagonist is a double stranded oligonu- 
30 cleotide that- inhibits demethylase at a Ki of 50nM, 

such as fc"^C™GC"*GC"'Gl . 

tG"'CG"'CG'"CG"'cJ n 

The inhibitors include, without limitation an 

anti-DNA demethylase antibody, an antisense of DNA 

35 demethylase or a small molecule such as any derivative 
of imidazole . 
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The change of the methylation pattern may acti- 
vate a silent gene. Such an activation of a silent gene 
permits the correction of genetic defect such as found 
for p-thalassemia or sickle cell anemia. 

The DNA demethylase of the present invention may 
be used to remove methyl groups on DNA in vitro such as 
needed for cloning DNA. 

The DNA demethylase of the present invention or 
its cDNAs may be used,, for changing the state of dif- 
ferentiation of a cell to allow gene therapy, stem cell 
selection or cell cloning. 

The DNA demethylase of the present invention or 
its CDNAS may be used, for inhibiting methylation in 
cancer cells using vector mediated gene therapy. 

in accordance with the present invention there 
is provided an assay for the diagnostic of cancer in a 
patient, which comprises determining the level of 
expression of DNA demethylase by either RT-PCT, ELISA 
or volatilization assay of the present invention xn a 
sample from the patient, wherein overexpression of the 
DNA demethylase is indicative of cancer cells. 



T>T?TTCTr nTCSCRIPTTON OF TWB DRAWINGS 

Figs. lA to IB illustrate the purification of 
25 demethylase (DNA dMTase) from human A549 cells; 

Figs. 2A and 2C illustrate that DNA dMTase xs a 
protein inhibited by RNA and not by ddCTP, mdCTP; 

Figs. 2B and 2D illustrate the kinetics of DNA 

dMTase activity; 
30 Figs. SA to 3C illustrate the product of DNA 

dMTase activity is cytosine and it exhibits no exonu- 
clease or glycosylase activity; 

Figs. 4A-4C illustrate the demethylation reac- 
tion releases methanol as a volatile residue; 
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Fig. 4D illustrates the transfer of a proton 
from water to regenerate cytosine; 

Figs. 4E-4F illustrate that the volatile product 
is methanol; 

5 Fig. 5 illustrates the suggested demethylat ion 

reaction; 

Figs. 6A-6D illustrate the substrate Specificity 
of DNA dMTase; 

Figs. 7A-7D illustrate chromatographic isolation 
10 of dMTase from human A54 9 cells; 

Figs. 8A-8B illustrate the alignment between the 
MDB domain of MeCP2 and demethylase and the predicted 
amino acid sequence of human demethylase; 

Fig. 8C illustrates the mRNA encoded by demethy- 

15 lase; 

Figs. 9A-9F illustrate the cDNA and their pre- 
dicted amino acid of demethylases and homologues of the 
present invention (SEQ ID N0S:l-8); 

Figs. lOA-B illustrate a mammalian expression 
20 vector of dMTase and in vitro translated dMTase poly- 
peptide; 

Fig. IOC illustrates that in vitro translated 
DNA dMTase releases volatile methyl residues from meth- 
ylated DNA; 

25 Fig. lOD illustrates that in vitro translated 

DNA dMTase transform methylated cytosines to cytosines; 

Fig. IIA illustrates that transiently trans- 
fected demethylase releases volatile residues from 
methylated DNA; 

30 Fig. 11-B illustrates the polypeptide expressed 

from transiently transfected demethylase; 

Figs, lie- HE illustrate that transiently trans- 
fected demethylase transforms methylated cytosines to 
cytosines in a protein dependent manner; 
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Fig. IIF illustrates that the transformation of 
methylated cytosine to cytosine by transiently trans- 
f ected demethylase depends on the concentration of sub- 
strate; 

Fig. 12A illustrates that transiently trans- 
fected demethylase catalyzes the transfer of a proton 
from tritiated water to regenerate cytosine; 

Fig. 12B illustrates that the cloned demethylase 
releases methanol from, methylated DNA; 

Figs. 13A-13C illustrate that the cancer cells 
express demethylase activity whereas normal cells do 
not ; 

Fig. 13D illustrates that demethylase mRNA is 
highly express in cancer cells; 

Fig- 14A illustrates demethylase bacterial ret- 
roviral and mammalian expression vector; 

Fig. 14B illustrates inhibition of demethylase 
activity by a specific inhibitor; 

Fig. 14C illustrates inhibition of tumorigenesis 
in vitro by an inhibition of demethylase; 

Fig. 15 illustrates inhibition of tumorigenesis 
in cell culture by induced expression of demethylase 

antisense vector; 

Fig. 16 illustrates the inhibition of demethy- 
lase by a small molecule inhibitor imidazole; and 

Fig. 17 illustrates a model for the inhibition 
of cancer growth by an inhibition of demethylase. 

DETAILED DESCRIPTION OF TWTC INVENTION 

The pattern of methylation is fashioned during 
development by a sequence of methylation and demethyla- 
tion events. The identity of the demethylase has 
remained a mystery and alternative biochemical activi- 
ties have been shown to demethylate DNA but no activity 
that can truly remove methyl groups from DNA has been 
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shown to date. Utilizing human lung carcinoma cells as 
a source for demethylase activity we demonstrate that 
mammalian cells bear a bona fide DNA demethylase (DMA 
dMTase) activity. DNA dMTase transforms methyl -C to C 
5 by catalyzing replacement of the methyl group on the 5 
position of C with a hydrogen derived from water. DNA 
dMTase demethylates both fully methylated and hemimeth- 
ylated DNA, shows dinucleotide specificity and can 
demethylate mdCpdG sites in different sequence con- 
10 texts. This enzyme is different from previously 
described demethylation activities: it is proteinase 
sensitive, activated by RNase and releases different 
products . 

DNA dMTase is a novel enzyme showing a new and 

15 unexpected activity that has not been previously 
described in any organism. The finding of a bona fide 
demethylase, points out new directions in our under- 
standing of the biological role of DNA methylation. 

In spite of the fact that it was previously 

2 0 shown that Ras expression in pi 9 cells can induce 
demethylation activity. It was not clear whether this 
demethylation activity is indeed a bona fide demethy- 
lase. One would predict that demethylase is present in 
embryonal cells. It was surprising to see that demeth- 

25 ylation activity is present in cancer cells. The find- 
ing of high levels of demethylase in A549 cells is 
indeed an unexpected discovery. 

In accordance with the present invention, it is 
shown and demonstrated that demethylation occurs by 

30 removal of a methyl group from methylated cytosine in 
DNA, that a hydrogen from water replaces the methyl 
group at the 5' position, that the resulting methyl 
group reacts with the remaining hydroxyl from water to 
generate methanol which volatilizes (Fig. 4E-F) . Thus, 
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bona fide demethylation of DNA involves the following 
reaction : 

^ • ^ mMA^4.H OH '^^"'ethYlase H-cytosine + CH3-OH 
CHj-cytosme- (DNA) +H-OH — r « 

The cDNA cloned in accordance with the present 
invention is the demethylase since it can convert 
methyl -cytosines in DNA to cytosines and volatilize the 
methyl groups on DNA when transcribed and translated xn 
vitro which are released as methanol. This is a novel 
CDNA encoding a biochemical activity that has been not 
described before . 

in accordance with the present invention, there is 
shown a model for the inhibition of cancer growth by an 
inhibition of demethylase (Fig. 17) . 



EXPERIMENTAL PROCEDURES 
Cell Culture 

A549 Lung carcinoma cells (ATCC : CCL 185) were 
grown in Dulbecco's modified Eagle's medium (with low 
20 glucose) supplemented with 10% fetal calf serum, 2 mM 
glutamine, 10 U/ml cif rof loxacin. Human Skin Fibro- 
blasts #72-213A MRHF were obtained from BioWhittaker , 
Bethesda and were grown in Dulbecco's modified Eagle's 
medium supplement with 2% fetal calf serum, 2 mM gluta- 
25 mine. H446 Lung carcinoma cells (ATCC: HTB 171) was 
grown in RPMI 1640 medium with 5% fetal calf serum. 
Preparation of nuclear extract 

Nuclear extracts were prepared from A549 cul- 
tures at near confluence as previously described (Szyf 
et al., 1991;- Szvf et al.,1995). The cells were tryp- 
sinized, collected and washed with phosphate-buffered 
saline and suspended in buffer A (10 mM Tris, pH 8.0, 
1 5 mM MgCl,, 5mM KCl, 0.5% NP-40) at the concentration 
of 10« cells per ml for 10 min. at 4«>C. Nuclei were 
collected by centrif ugation of the suspension at 1000 g 
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for 10 minutes. The nuclear pellet was resuspended in 
buffer A (400 /il) and collected as described in the 
experimental procedures. A nuclear extract was* pre- 
pared from the pelleted nuclei by suspending them in 
5 buffer B (20 mM Tris, pH 8.0, 25% glycerol, 0 . 2 mM EDTA 
and 0.4 mM NaCl) at the concentration of 3.3x10° nuclei 
per ml and incubating the suspension for 15 min. at 
4*^0. The nuclear extract was 'separated from the 
nuclear pellet by centrif ugation at 10,000g for 30 min- 

10 utes. Nuclear extract were stored in -80 '^C for at least 
two months without loss of activity. 
Chromatography on DEAE-Sephadex 

A freshly prepared nuclear extract (1 ml , 1.1 
mg) was passed through a Microcon™ 10 0 spin column, the 

15 retainant was diluted to a conductivity equivalent to 
0.2 M NaCl in buffer L and applied onto a DEAE-Sephadex 
column (Pharmacia) (1.0 x 5 cm) that was preequili- 
brated with buffer L (10 mM Tris-HCl, pH 7.5, 10 mM 
MgCl^ ) containing 0.2 M NaCl at a flow rate of 1 

2 0 ml /min. The column was then washed with 15 ml of the 
starting buffer (buffer L + 0.2 M NaCl) and proteins 
were eluted with 5 ml of a linear gradient of NaCl 
(0.2-5.0 M) . 0.8 ml fractions were collected and 
assayed for demethylase activity after desalting 

25 through a Microcon™ 10 spin column (Amicon) and resus- 
pension of the retainant in 0.8 ml buffer L. DNA 
demethylase eluted between 2-5.0 M NaCl. 
Chromatography on S-Sepharose 

Active DEAE-Sepharose column fractions were 

30 pooled, adjusted to 0.1 M NaCl by dilution and loaded 
onto an S-sepharose column (Pharmacia) (1.0 x5 cm) 
which had been preequilibrated with buffer L containing 
0.2 M NaCl at a flow rate of 1 ml/min. Following wash- 
ing of the column as described in experimental proce- 

35 dures, the proteins were eluted with 5 ml of a linear 
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NaCl gradient (0.2-5.0M). 0.5 ml fractions were col- 
lected and assayed for DNA demethylase activity after 
desalting and concentrating to 0 . 2 ml using a Microcon 
10 spin column. DNA demethylase activity eluted around 
5.0 M NaCl. 

Chromatography on Q-Sepharose 

Active fractions from S-sepharose column were 
pooled, adjusted to 0 . 2 M NaCl by dilution and applied 
onto a Q-sepharose (Pharmacia) column (1.0 x5 cm) which 
had been equilibrated as described in the experimental 
procedures at a flow rate of 1 ml/min. The column was 
washed and the proteins were eluted with a linear NaCl 
gradient (0.2- 5.0 M) - Fractions (0.5 ml) were col- 
lected, assayed for demethylase activity after desalt- 
15 ing and concentrating to a final volume of 0 . 2 ml as 
described in the experimental procedures. The demethy- 
lase activity eluted around 4.8-5.0 M NaCl. 
Gel-Exclusion Chromatography on DEAE-Sephacel 

The pooled fractions of Q-sepharose column were 
20 adjusted to 0 . 2 M NaCl, loaded onto a 2 . 0 x 2 . 0 cm 
DEAE-Sephacel column (Pharmacia) and eluted with 10 mi 
of buffer L containing 0.2 M NaCl. The fractions (0.8 
ml) were collected and assayed after concentration to 
about 180 Ml with a Microcon- 10 spin column for DNA 
25 demethylase activity. The activity was detected at 
fraction 4, which is very near the void volume 
(-200kDa) . 

Assay of DNA demethylase activity 

To directly assay DNA demethylase activity xn 

30 vitro two independent methods were applied. 

(A) TO assay the conversion of methyl-dCMP (mdCMP) to 
dCMP we used a previously described method (Szyf et 
ai., 1995). Briefly. a»P labeled, fully methylated 
poly [mdC»PdGl n substrate was prepared as follows. One 

35 hundred ng of a double -stranded fully methylated 
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(mdCpdG) oligomer (Pharmacia) were denatured by boil- 
ing, which was followed by partial annealing at room 
temperature. The complementary strand was extended 
with Klenow fragment (Boehringer Mannheim) using 
5 methyl-5-dCTP (mdCTP, 0 . 1 mM) (Boehringer Mannheim) and 
[a-^^P] GTP (100 fiCi, 3000 Ci/mmol) , and the unincorpo- 
rated nucleotides were removed by chromatography 
through a NAP-5 column (Pharmacia)'. The NAP-5 chroma- 
tography was repeated, to exclude minor contamination 

10 with unincorporated nucleotides. As a control a non- 
methylated poly [dC"pdG] n substrate was similarly pre- 
pared except that a nonmethylated dCpdG oligomer served 
as a template and dCTP was used in the extension reac- 
tion. The column fractions (30 /iD / described in the 

15 experimental procedures were incubated with 1 ng of 
poly [mdC^^pdG] n substrate for 1 hour at 37^C in a 
buffer L containing 25% glycerol (v/v) and 5 mM EDTA. 
The reacted DNA as well as a nonmethylated 
poly [dC^^pdG] n and methylated [mdC'^^pdC] n nonreacted con- 

20 trols were purified by phenol /chloroform extraction and 
subjected to micrococcal nuclease digestion (100 fig at 
10 /il) and calf spleen phosphodiesterase (2/xg) 
(Boehringer) (Pharmacia) to 3' mononucleotides for 15 
hours at 37 ^C. The digestion products were loaded onto 

25 a thin layer chromatography plate (TLC) (Kodak, 13255 
Cellulose) , separated in a medium containing, 132 ml 
Isobutyric acid: 40 ml water: 4 ml ammonia solution, 
autoradiographed and the intensity of the different 
spots was determined using a phosphorimager (Fuji, HAS 

30 2000) . "P labeled substrates and tritium labeled sub- 
strates were phosphoimaged using HAS 2000 plate and 
BAS-TR2 04 0 phosphorimager plate respectively - 
(B) The second method determined removal of methylated 
residues from methylated DNA by measuring disappearance 

35 of ^H-CHj or "C-CH^ from the reaction mixture. 100 ng 
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of poly [dCdG]n double stranded DNA was methylated 
using SssI methylase (New England Biolabs) and an 
excess of ['H-methyl AdoMet (80 Ci/mmol; New England 
Nuclear)]. The tritiated methyl group containing DNA 
was purified from labeled AdoMet using NAP- 5 column 
chromatography. All column purified fractions of DNA 
demethylase were assayed using the tritiated s^lbstrate. 
In a typical assay, 1 ng of DNA 'was incubated (at a 
specific activity of 4 xloMpm/mg) with 30 fil of column 
fraction for one hour at 37 °C in buffer L. To deter- 
mine the number of methyl groups remaining in the DNA 
following incubation with the different fractions, 250 
^1 of water were added and the mixture was incubated at 
65°C for 5 minutes. One hundred fxl of the reaction 
15 mixture were withdrawn for liquid scintillation count- 
ing. Controls received similar treatment except that 
in place of a column fraction, an equal volume of 
buffer L was added. The number of methyl groups that 
were removed from the DNA by the different fractions 
20 was determined by subtracting the remaining counts in 
each of the fractions from the counts remaining in the 
control. All tests were carried out in triplicates. 
The results are presented as picomole methyl group 
removed. One unit of DNA dMTase activity is defined 
25 as: amount of enzyme that releases one picomole of 
methyl group from methylated dCpdG substrate in one 
hour at 3 7°C. 

Methyl removal assay using double- labeled substrates 

TO determine whether the methyl group leaves the 
30 DNA and not 'any non-specific removal of tritium, we 
prepared SK plasmid DNA containing a tritiated hydrogen 
at the 6' position of cytosine and thymidine by growing 
the plasmid harboring bacteria in the presence of deoxy 
[6-'H] Uridine (22 Ci/mmol; Amersham) (lOMCi/ml) . The 
35 [6 -^H] -cytosine containing pBluescript SK( + ) was puri- 
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fied according to standard protocols and was methylated 
using an excess of [''C-methyl] AdoMet (59 mCi/mmol; 
Amersham) (10 fiCi per 100 fxl reaction) and SssI methy- 
lase. The double labeled DNA substrate was purified 
5 twice on a NAP-5 column. 15 fil of DNA dMTase were 
incubated with 1 ng of double labeled DNA (specific 
activity of 2000 dpm/ng) for 1 hour at 37 '^C. Follow- 
ing incubation, the remaining Versus counts were 
determined as described in the experimental procedures 

10 by scintillation counting (Wallac) . The ^^C counts were 
normalized against 'H counts. The controls received 
similar treatment except that instead of DNA dMTase, an 
equal amount of distilled water was added to them. 

To determine the number of ^H-CHj in the gaseous 

15 phase, 1 ng of 'H-CH3 poly [dCpdG] DNA were incubated 
with DNA dMTase overnight in a sealed tube (Pierce, 
Illinois, USA). 0.8 ml of air were removed from the 
tube using a gas tight syringe (Hamilton, Reno, Nevada) 
and injected into a sealed gas tight scintillation vial 

20 containing 10 ml OptiPhase scintillation fluid (Wallac, 
UK) and counted. As a control the DNA was incubated 
with an equal volume of buffer L and treated similarly. 
Synthesis of other methylated dC dinucleotides 

Poly [mdC^'pdA] and [mdC^'pdT] substrates were 

25 prepared as follows. About 0.5 /xg of 20 mer oligonu- 
cleotides 5'(GG)103', 5'(GT)103' and 5'(GA)103' were 
boiled and annealed at room temperature with oligonu- 
cleotide S'CCCCCCB', 5'CACACA3' and 5'CTCTCT3' respec- 
tively. The complementary strand was extended with 

3 0 Klenow fragment using mSdCTP (Boehringer Mannheim) and 
either [a"P] dATP (lOO/xCi, 3000Ci/mmol) or [a"Pl dTTP 
(100 /xCi, 3000 Ci/mmol) respectively. The unincorpo- 
rated nucleotides were removed by chromatography 
through a NAP- 5 column. Hemimethylated mdCpG substrate 

3 5 was prepared in a similar manner except that a nonmeth- 
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ylated poly dCpdG substrate (Boehringer) was used 
template and tnSdCTP and [a"P] dGTP were used for exten- 
sion as described in the experimental procedures. 
Assay for nuclease and glycosylase activity 
5 ["pmdCpdGln substrate which included a labeled 

«P 5' t®^mdC was prepared as follows. About 100 ng of 
poly dCpdG DNA were boiled and partially annealed at 
room temperature. [a"P]dCTP and col'd dGTP were used for 
complementary strand extension as described in the 
10 experimental procedures. The free nucleotides were 
separated using NAP- 5 column chromatography. The puri- 
fied ["pmdCpdGln DNA was subjected to methylation by 
Sssi methylase using 320 nM AdoMet . The DNA was repuri- 
fied twice using a NAP-5 column. The methylated DNA (1 
15 ng) was incubated with either 30 ^1 DNA dMTase. nuclear 
extract or buffer L. To determine whether a"P labeled 
residue is excised from the DNA it was directly applied 
(3Ml) onto a TLC plate. To determine whether the DNA 
was demethylated it was subjected to digestion with 
20 snake venom phosphodiesterase (0.2 mg in a 10^1 reac- 
tion volume) (Boehringer Mannheim) which attacks the 
3' -OH group releasing 5 ' -mononucleotides . The result- 
ing mononucleotides were separated on TLC plates and 
autoradiographed . 
25 TO test whether dCTP copurifies with DNA dMTase. 

which may be involved in activities other than bona 
fide demethylation, 20 fiH of dCTP with 1 /xl of a P 
labeled dCTP (3000 Ci/mmole) was loaded onto the column 
with nuclear extract. The "P counts were measured in 
30 the flow through, washes and in the different frac- 
tions. About 1.1 million counts were loaded onto the 
DEAE-Sepharose column and were all recovered up to 
fraction 8. 

TO determine whether DNA dMTase contains a DNA 
35 polymerase activity, DNA demethylase reactions were 
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performed in presence of 500 /xM of ddCTP (Pharmacia) or 
500 (jM of mSdCTP (Boehringer Mannheim) at initial rate 
conditions . 

To determine whether DNA dMTase is sensitive to 
5 RNase or Proteinase K treatment, DNA dMTase was pre- 
treated for 1 h at 56^C with 200 /zg/ml proteinase K 
(Sigma) . A demethylation reaction was carried out with 
this pretreated fraction in the usual manner using both 
demethylation assays described in the experimental pro- 

10 cedures. To test the effect of RNA digestion on the 
demethylation reaction, the fractions from different 
columns were treated with 100 fig/ml RNase A (Sigma) . 
Demethylation of pBluescript SK( + ) Plasmid 

About 4 fig plasmid pBluescript SK (Stratagene) 

15 was subjected to methylation using SssI methylase. The 
methylated plasmid (4 ng) was incubated with 30 /xl of 
DNA dMTase Fraction 4 of DEAE-Sephacel column under 
standard conditions, extracted with phenol: chloroform 
and precipitated with ethanol . About 1 ng of the plas- 

20 mid were subjected to digestion with 10 units each of 
either of the restriction endonucleases EcoRII (GIBCO- 
BRL) , Dpnl, Hhal or Hpall (New England Biolabs) before 
and after methylation as well as after DNA dMTase 
treatment in a reaction volume of 10 /il for 2 hour at 

25 37 °C. Following restriction digestion the plasmids 
were extracted with phenol : chloroform, ethanol precipi- 
tated and resuspended in 10 [il . The plasmids were 
electrophoresed on a 0.8% (w/w) Agarose gel, trans- 
ferred onto a Hybond Nylon membrane and hybridized with 

30 pBluescript SK'( + ) plasmid which was "P labeled by ran- 
dom-priming (Boehringer Mannheim) . 

Effect of Redox Reagents (NAD^ NADH, NADP, NADPH and 
FeClj) on demethylase activity 

The reagents were prepared at 100 /iM concentra- 
35 tion and added at a final concentration of 10 //M to a 
standard methyl removal assay under initial rate condi- 
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tions as described in the experimental procedures. The 
methyl removal activity in presence of each of the 
cof actors was compared to a control DNA dMTase reac- 
tion. 

5 Determination of kinetic parameters 

For determination of kinetic parameters, the 
demethylation reactions were performed using both 
assays (generation of dCMP and removal of methyl) as 
described in the experimental procedures except that 
10 varying DNA concentrations from 0.1 nM to 2.5 nM were 
used in a total volume of 50^1 including 30 ^1 of DNA 
dMTase. Since it has been established by previous 
experiments that the reaction proceeds for at least 3 
hours, the initial velocity of reaction was measured 
at one hour intervals. The velocity data was collected 
at each substrate DNA concentration range stated for 
both assays. The Km and Vmax values for DNA demethy- 
lase activity were determined from double reciprocal 
plots of velocity versus substrate concentration. 
20 Measurements of methanol production catalyzed by 
demethylase by gas chromatography 

Gas chromatography was performed with a Varian™ 
model 3400 GC equipped with a 30m Stabilwax™ column 
(0 053 cm i.d.: Restek Corporation). Nitrogen™ was 
25 used as carrier gas at a flow rate of 32 ml/min, the 
injector and detector chambers were at 200 and 300°C 
respectively. The column was maintained at 40°C for 5 
minutes after sample injection. 

The demethylase reaction was performed in eppen- 
30 dorf tubes kept within sealed scintillation vials with 
300 Hi of water as aqueous phase (in radioactive trap- 
ping experiments this was replaced by 300 ^il of metha- 
nol) . The demethylase reaction was initiated in buffer 
L (10 mM MgCl,, 10 mM Tris-HCl pH 8.0) with 500 ng of 
35 tritiated SK plasmid (6000 dprn/^il) and 100 ^il ^of 
demethylase at 37°C. After overnight incubation at 37°C, 
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the aqueous phase surrounding the eppendorf tube was 
transferred to a fresh eppendorf tube, 2 [xl of this 
mixture was injected in the gas chromatography using a 
gas tight syringe (Hamilton, Reno, Nevada) . 
5 Coupled in vitro transcription translation 

The mRNAs encoded by the pcDNA 3 . l/His Xpress 
demethylase constructs described above were transcribed 
and translated by coupled transcription- translation 
using Promega*^^ TNT re.ticulocyte lysate kit (according 

10 to manufacturer's protocol), 2 fig of each construct and 
4 0MCi of [^^-S] methionine (1 , OOOCi/mmol , Amersham) in a 
50/xl reaction volume. To purify non labeled in vitro 
translated demethylase, coupled in vitro transcription 
and translation was performed as above but in the pres- 

15 ence of cold methionine. The translation products were 
bound to a Probond™ nickel column (Invitrogen) and 
demethylase was eluted according to the manufacturer's 
protocol with increasing concentrations of imidazole. 
Demethylase is eluted at 350-500mM imidazole. The imi- 

20 dazole eluted demethylase was dialyzed and concentrated 
by 1 yophi 1 i za t i on . 

Gas chromatography coupled with Mass spectrometry (GC- 
MS) Analyses for identification of volatile product of 

25 demethylase catalyzed reaction as methanol 

The demethylation reactions (volume 50 1) were 
run in conical vials having a total internal volume of 
350 microlitres. The vials were closed with a teflon- 
lined screw cap and left at room temperature for 18 h. 

3 0 The vials were cooled in an ice bath, opened and 10 mg 
of NaCl and 50 microlitres of toluene were added. The 
vials were frequently shaken over a period of 1 h. The 
toluene phases were pipetted into clean vials in a man- 
ner to rigorously exclude water carry over. Anhydrous 

35 sodium sulfate (5 mg) was added to the toluene extracts 
to remove water, and the toluene phases were pipetted 
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into autoinjector vials for GC/MS analysis. Aliquots 
of 3 microlitres were analyzed under the following 
instrumental conditions: Instrument: Hewlett-Packard 
5988A; Column: 30 m x 0.25 mm i.d. fused quartz capil- 
5 lary with 0.25 micron DB-1 liquid phase, programmed 
after an initial hold for 1 min at 70 deg at 5 deg/min 
to 80 deg, then ramped ballistically to 280 deg for 
bake-out for 5 min; Injector and interface tempera- 
tures: 250 deg; Helium flow rate 1.5 ml/min; Mass 
10 spectrometer: ion source 200 deg, 70 eV electron impact 
ionization, scanning from m/z 10 to 50 in full scan 
mode was begun 6 s after injection, and ceased at 1 . 5 
min to avoid acquisition of the intense toluene solvent 
peak. 

Human A549 cells bear a demethylase activity that could 
be purified away from dCTP and DNA MeTase 

The use of an appropriate cellular source and a 
direct assay for demethylase activity are obviously 
critical. AS we have previously shown that demethylase 
activity was induced in response to ectopic expression 
of the Ras oncogene (Szyf et al., 1995) we reasoned 
that cancer cells might bear high levels of demethy- 
lase activity. Based on preliminary studies demon- 
25 strating the presence of high levels of demethylase 
activity in the human lung carcinoma cell line A549, we 
have chosen this cell line for our further studies and 
purification steps. Previous studies have used indi- 
rect measures such as increased sensitivity to methyla- 
30 tion-sensitive restriction enzymes as indicators of 
demethylase activ ty (Weiss et al . , 1996; Jost et al . , 
1995) . To directly measure the conversion of 5-mdCMP 
in DNA to dCMP, we have utilized a completely methyl- 
ated "P labeled [mdC"pdG] n double stranded oligomer 
35 which we had previously described (Szyf et al . . 1995). 
Following incubation with the different fractions, the 
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DNA is purified and subjected to cleavage with microco- 
cal nuclease to 3' mononucleotides. The 3' labeled 
mdCMP and dCMP are separated by thin layer chromatogra- 
phy (TLC) and the conversion of mdCMP to dCMP is 
directly determined. This assay provides a stringent 
test for bona fide demethylation and discriminates it 
from previously described BmCpC replacement activities 
(Jost et al., 1995; Weiss et al.,' 1996). The glyco- 
sylase-demethylase activity described by Jost et al . 
(Jost et al., 1995) will require the presence of a 
ligase activity and an energy source for replacement of 
mdC with C to be detected by our assay, whereas the 
demethylase activity described by Weiss et al . will not 
be detected since it replaces the intact mdC^^pdG dinu- 
15 cleotide with a cold dCpdG without altering its state 
of methylation (Weiss et al . , 1996). 

Nuclear extracts were prepared from A54 9 cells, 
applied onto a DEAE-Sephadex column, eluted with a lin- 
ear gradient from 0.2-5.0M NaCl and the fractions were 
20 assayed for demethylase (dMTase) activity as described 
in the experimental procedures. As shown in Fig. 1(A) 
a clear peak of dMTase activity is eluted at the high 
salt fraction 10. 

Conversion of methylated cytosine to cytosine: 
25 Nuclear extracts prepared from A549 cells (1-1 mg) were 
passed through an AMICON™ 100 spin column. The retain- 
ant (98.56 mg, 0.2 mg/ml) was loaded onto a DEAE-Sepha- 
rose column, the different chromatographic column frac- 
tions eluted by a linear NaCl gradient (0.2-5M) were 
30 desalted and '(30 fxl) incubated with 1 ng of [mdC'^pdG] n 
double stranded oligomer for 1 hour at 37 "C, digested 
to 3' mononucleotides and analyzed on TLC as described 
in the experimental procedures. Control methylated 
(ME) and nonmethylated (NM) [dC''pdG]n substrates were 
35 digested to 3' mononucleotides and loaded on the TLC 
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plate to indicate the expected position of dCMP and 
mdCMP. The active fraction is indicated by an arrow. 
This fraction was loaded on S-Sepharose followed by Q- 
Sepharose and DEAE-Sephacel fractionation. 

The first chromatography step purified the 
dMTase activity from the bulk of nuclear protein 
(Fig. IB) and is a very effective purification step. 

DNA dMTase activity as measured by the release 
of volatile methyl residues. The different column 
fractions were incubated with Ing (4 x 10* dprn/^g) of 
[^H] -CHj- [mdCpdG]n oligomer and the release of volatile 
methyl residues was determined (-) and presented as 
total dpn) . The results are an average of three inde- 
pendent determinations. Protein concentration was 
15 determined using the Bio-Rad Bradford kit (-). The 
elution profile of 20 of t"P] -a-dCTP incubated with 
the protein was determined by scintillation counting of 
the different DEAE fractions (-) and presented as frac- 
tion of dCTP loaded on the column. 

To exclude the possibility that the DNA dMTase 
activity detected in our assay is carried by the DNA 
MeTase, we assayed the fractions for DNA MeTase activ- 
ity using a hemimethylated DNA substrate as previously 
described (Szyf et al . , 1991). As observed in Figure 
25 IB DNA MeTase activity is detected in the second and 
third fractions, thus our fractionation separated DNA 
dMTase away from the DNA MeTase suggesting that they 
are independent proteins . 

There is a remote possibility that the demeth- 
30 ylation observed is not a Jbona fide demethylation but 
a consequence of a glycosylase removal of mC, followed 
by removal of the remaining deoxyribose -phosphate by AP 
(apyrimidine) nuclease, repair of the gap catalyzed by 
DNA polymerase using trace dCTP contained in the frac- 
35 tion and ligation of the break with ligase in the pres- 
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ence of residual ATP. For this hypothesis to be con- 
sistent with our data, four independent enzymes and two 
cof actors have to cof ractionate with DNA dMTase . To 
exclude the possibility that a trace amount of dCTP is 
5 bound to DNA dMTase active fraction, we have added 20 
fiM of "P labeled dCTP (10x10^ cpm) to the nuclear 
extract and determined its elution profile on the DEAE 
column. Less than background cpm (10 cpm) were 
detected in the DNA dMTase active fraction suggesting 

10 that our first column purifies dCTP away from the DNA 
dMTase at least 1x10* fold (Fig. IB) . If any dCTP is 
present in the nuclear extract, the remaining concen- 
tration after fractionation on DEAE is well below the 
Kms of the known DNA polymerases. The possibility that 

15 dCTP is so tightly bound to the enzyme that it could 
not be replaced by the exogenous "P labeled dCTP is 
very remote since an enzyme using dCTP as substrate 
must readily exchange dCTP. 

The active fraction 10 was further fractionated 

20 sequentially on the following columns: S-Sepharose and 
Q-Sepharose. The DNA dMTase eluted at the high salt 
fraction from both columns as determined by the 
[mdC"pdG]n demethylation assay (Fig. lA) . The ion 
exchange chromatography was followed by chromatography 

25 on DEAE-Sephacel . 

The fact that we have maintained our activity 
even after 4 fractionation steps (Table 1) and that 
only a single polypeptide is apparent after the last 
purification step argues strongly against the possibil- 

30 ity that the a-ctivity detected in our study is a repair 
or replacement activity. Any replacement mechanism 
must involve a number of proteins and additional cofac- 
tors and substrates. In summary, the chromatography of 
the demethylase activity in A459 cells provides strong 
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support to the hypothesis that manunalian cells bear a 
Jbona fide demethylase activity. 
DNA dMTase releases a volatile derivative 

A Jbona fide demethylation has to result in 
release of the methyl group as a volatile derivative 
such as CO,-, methanol, methane or formaldehyde. We 
have therefore incubated a { [^H] -CH3-dCpdG}n double 
stranded oligonucleotide with the different colunua 
fractions and the rate of release of the tritiated 
methyl from the aqueous phase was determined by scin- 
tillation counting of the remaining radioactivity in 
the reaction mix. As demonstrated in Fig. lb (dia- 
mond), the dMTase active fractions release labeled 
methyl groups from the methylated substrate. 

DNA dMTase is a protein which is i-^^^^^^^^^/^^f'^;^t^e 
not involve an exchange activity and does not require 

additional cofactors 

DNA dMTase activity measured either as transfor- 
mation of mdC to C (Fig. 2a) or as release of volatile 
methyl residues (Fig. 2c) is abolished after proteinase 
K treatment and is not inhibited but rather enhanced 
following RNase treatment. 500 of ddCTP which 

inhibits DNA polymerase does not inhibit demeth- 

ylation of the [mdC32pdG]n substrate, nor is it inhib- 
ited by high concentrations of methyl-dCTP (500 ^M) 
(Fig. 2a), which is consistent with the hypothesis that 
demethylation does not involve an excision and replace- 
ment mechanism. If a replacement mechanism is involved 
in demethylation. the presence of mdCTP should result 
in incorporation of methylated cytosines and essential 
inhibition of demethylation. Thus, the DNA dMTase 
identified here is a protein and not an RNA and is une- 
quivocally different from the previously published RNA 
35 based or glycosylase based demethylase activities. 
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The DNA dMTase reaction proceeds without any 
requirement for additional substrates such as dCTP, 
redox factors such as NADH and NADPH or energy sources 
such as ATP (data not shown) . As observed in Fig. 2b 
5 and 2d, the DNA dMTase reaction maintains its initial 
velocity up to 90 minutes and continues up to 120 min- 
utes. This time course is inconsistent with dependence 
on enzyme-bound additional nonrepl'enishable substrates 
such as dCTP or ATP or a nonreplenishable redox factor 
10 such as NADH or NADPH. Exhausting the nonreplenish- 
able substrate or redox factor would have resulted in 
rapid deceleration of the initial velocity. 

A product of the demethylation reaction is deoxyCyto- 

15 sine in DNA 

What is the product of the demethylation reac- 
tion? The results presented above (Fig. la, 2a and b) 
based on a one dimension TLC separation show that DNA 
dMTase generates dC from mdC in DNA. To further sub- 

20 stantiate this conclusion, we subjected DNA dMTase 
treated DNA to remethylation with the CpG MeTase M.Sss 
I which can transfer a methyl group exclusively to dC . 
The results presented in Fig. 3a show that the demeth- 
ylated product of DNA dMTase is dC since it is com- 

25 pletely remethylated with M.Sss I. The identity of the 
demethylated product as dC was further established by a 
two-dimension TLC analysis demonstrating that the prod- 
uct of dMTase comigrates with a cold dCMP standard in 
both dimensions (Fig. 3b). 

3 0 DNA dMTase does not release a nucleotide, a 

phosphorylated base or phosphate from methylated DNA 
when incubated with a [32pmdCpdGln substrate which 
included a labeled 32P 5' to mdC or our standard meth- 
ylated substrate (Fig.l) where 32P is 3' to the mSdC 

35 (Fig. 3c). Nuclear extracts which obviously contain a 
number of glycosylases and nucleases release phospho- 
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rylated derivatives in the same assay (Fig. 3c). 
dMTase transforms the methyl cytosine in the 
[32pmdCpdG]n substrate to cytosine as demonstrated 
when the reacted DNA is digested to 5' mononucleotides 
5 (Fig 3c +V PDS) and analyzed by TLC . Since this 

reaction does not involve release of a 32P derivative 
(Fig 3c -V PDS), it demonstrates that dMTase trans- 
forms methylated cytosines to cytokines on DNA without 
disrupting the integrity of the DNA substrate by glyco- 
10 sylase or nuclease activity . 

The second product of the dMTase reaction is methanol 

What is the identity of the leaving group? The 
results presented in Figlb suggest that the labeled 
15 methyl leaves the DNA as a volatile compound. The 
demethylase reaction involves release of the methyl 
group per se whereas the cytosine base ring remains in 
the aqueous phase. Fig. 4a demonstrates this point by 
using a methylated plasmid labeled with a ^H-hydrogen 
20 at the sixth position of cytosine and [14C] -methyl at 
the fifth position of cytosine as a substrate. 

The three most obvious candidates the methyl 
group is leaving as are formaldehyde, carbon dioxide, 
and methanol. Methadone trapping for labeled formalde- 
25 hyde detection and sodium hydroxide trapping for 
labeled carbon dioxide detection were both negative in 
identifying the form in which the methyl group is leav- 
ing in the dMTase reaction (data not shown) . The other 
possible chemical form that the methyl group may leave 
30 the DNA as, is methanol. Since methanol is a volatile 
compound, a 'simple method to measure generation of 
methanol is a scintillation-volatilization assay (see 
Fig 4b for description). Volatilization assays have 
been previously used to measure release of methanol m 
35 demethylation reactions. The demethylation reaction 
mix containing the labeled { [^H] -CH^-dCpdOn substrate 
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with either dMTase or no enzyme, as a control, is added 
to an uncapped 0 . 5 ml tube which is placed in a sealed 
scintillation vial containing scintillation fluid. 
Released methanol is volatile, diffuses out of the open 
5 reaction tube and is mixed with the excess of the scin- 
tillation fluid in the vial registering as counts in 
the scintillation counter. As a control indicating 
that methanol is volatilized undei: the conditions of 
our assay, we incubated approximately equal counts of 

10 radioactively labeled methanol under the same condi- 
tions and measured the counts in a scintillation coun- 
ter at different time points. As observed in Fig. 4c 
the majority of methanol in the reaction tube volatil- 
izes from the reaction tube into the scintillation 

15 fluid following an overnight incubation at 37°C. The 
experiment shown in Fig. 4b demonstrates that volatil- 
ized label is released from methylated DNA only in the 
presence of dMTase . 

The identity of the volatile group has been 

20 determined to be methanol by a gas chromatography (GC) 
analysis. The demethylat ion and control reactions 
(indicated in Fig. 4e) were performed in an uncapped 
tube placed in a sealed scintillation vial containing a 
larger volume (300/xl) of water. The volatile residue 

25 diffuses into the surrounding water and mixes with it. 
A 2 [il sample of the surrounding water was injected 
into a GC column as described in the methods. As 
shown in Fig. 4e, the volatile compound released by 
dMTase in a dose response manner coelutes with metha- 

30 nol. Release of methanol is observed only in the pres- 
ence of both dMTase and methylated DNA. No methanol is 
released when dMTase is reacted with nonmethylated DNA, 
demonstrating that methanol is a product of demethyla- 
tion of DNA. 
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The leaving group was also identified as metha- 
nol using gas chromatography coupled with Mass spec- 
trometry (GC-MS) . AS illustrated in Fig. 4f., incuba- 
tion of methylated DNA with dMTase (dMTase+ME-DNA) 
5 results in release of a peak with the retention time 
and mass spectrum (peaks are identified at 32 and 29 
atomic mass which are the atomic masses of methanol and 
ionized methanol respectively) whidh is consistent with 
its identification as . methanol . Incubation of dMTase 
10 with nonmethylated DNA does not release methanol indi- 
cating that methanol is a product of the demethylation 
reaction. No methanol is released when the samples are 
incubated with dMTase treated with protease K indicat- 
ing that the release of methanol from methylated DNA is 
15 catalyzed by an enzymatic activity. 

Demethylation involves transfer of a hydrogen from 
water to regenerate cytosine 

If demethylation involves removal of the methyl 
moiety from mdC, a hydrogen has to be transferred to 
the carbon at the 5' position to regenerate cytosine. 
Since no redox factors are involved, what is the source 
of the hydrogen? To test the hypothesis that the 
source of the hydrogen is water, we incubated either 
non labeled tmdCpdG] n or [dCpdG]n double stranded DNA 
with DNA dMTase for different time periods in the 
presence of tritiated water, following which the DNAs 
were digested to 3' dNMPs, separated on TLC with non- 
radioactive standards for each of the 5 possible dNMPs 
and exposed to a tritium sensitive phosphorimaging 
plate. As seen in Fig.4d, dMTase catalyzes the trans- 
fer of a tritiated hydrogen from water to dCMP in meth- 
ylated DNA in a time dependent manner only when meth- 
ylated DNA is used as a substrate. Based on the 
35 experiments described in Fig. 3 and 4 we propose that 
dMTase catalyzes the exchange of the methyl group at 
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the 5' position of cytosine in DNA with hydrogen from 
water and the methyl group reacts with the remaining 
hydroxyl group to form methanol (Fig. 5) . 

5 Substrate and sequence specificity of DNA dMTase 

Methylation of CpG dinucleotides is the most 
characterized modification occurring in genomic 
DNA8,48. The results presented in Fig . 6 demonstrate 
that DNA dMTase is a general DNA dMTase activity that 

10 demethylates fully or hemimethylated dCpdG in DNA 
flanked by a variety of sequences which are distributed 
at different frequencies, but does not demethylate 
methylated adenines or methylated cytosines that do not 
reside in the dinucleotide CG. First, as shown in 

15 Fig. 6a, a plasmid DNA methylated in vitro at all dCpdG 
sites with M.Sss I and all d*CdCdGdG sites with M. Msp 
I (which methylates the external C in the sequence 
*CCGG, thus enabling the determination of demethylation 
at the CC dinucleotide) and in vivo with the E. coll 

20 DCM MeTase at dCmdCdA/dTdGdG sites and with the DAM 
MeTase at dGmdAdTdC sites (adenine methylated) was 
treated with dMTase and the state of methylation of the 
plasmid was determined using the indicated methylation 
sensitive restriction enzymes. dMTase demethylates C*G 

25 methylated sites as indicated by the sensitivity of the 
dMTase treated plasmid to Hpa II and Hha I but does not 
demethylate C*C,C*A or C*T methylated sites as indi- 
cated by the resistance to Msp I and Eco RII restric- 
tion enzymes, or adenine methylation as indicated by 

30 its sensitivity to Dpn I. Second, bisulfite mapping 
analysis of methylation of 5 methylated C*G sites 
residing in a M.Sss I in vitro methylated pMetCAT plas- 
mid following dMTase treatment shows that all C*G sites 
are demethylated irrespective of their flanking 

35 sequences thus excluding the possibility that demeth- 
ylation is limited to CCGG or CGCG sequences (Fig. 6b) . 
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Third, dMTase does not demethylate two fully methylated 
cytosine bearing oligomers [dmC32pdA] n, [mdC3 2pdT] n 
demonstrating that mdCpdA and mdCpdT are not demethyl- 
ated by DNA dMTase (Fig. 6d) . Fourth, dMTase demethyl- 
- ates a hemimethylated synthetic substrate 

[dCpdG]n*[mdC32pdG]n (Fig. 6d) . Demethylation of SK is 
complete under these conditions (Fig. 6a) whereas 
demethylation of a methylated [mdCpdG] n substrate is 
not complete under the . same conditions (Fig. 6d) . This 
0 can reflect differences in the sequence composition of 
the substrate and the frequency of methylated cyto- 
sines The [mdCpdG] n contains on average 16 fold more 
methylated cytosines per molecule than plasmid DNA. 
Alternatively, these differences might reflect discrep- 
5 ancies in the assays used, restriction enzyme digestion 
versus a nearest neighbor analysis. To address this 
discrepancy we have labeled a fully methylated SK plas- 
mid with [a"P]dCTP, 5-methyl-dCTP and the other dNTPs, 
subjected it to dMTase treatment and digested it to 
20 mononucleotides at different time points following the 
initiation of the reaction and subjected the samples to 
a TLC analysis. As shown in Fig. 6c, the SK plasmid is 
fully demethylated at 3 hours which is consistent with 
the results obtained with methylation sensitive 

25 restriction enzymes (Fig. 6a). , ^ 

The Km of DNA dMTase for hemimethylated and 
fully methylated DNA was determined by measuring the 
initial velocity of the reaction at different concen- 
trations of substrate (Table 2) . The calculated Km for 
hemimethylated DNA is 6 nM which is two fold higher 
than the Km for DNA methylated on both strands, 2.5-3 
nM (Table 2). It is unclear yet whether this small 
difference in affinity to the substrate has any sig- 
nificance in a cellular context. Thus similar to the 
35 DNA MeTase DNA dMTase shows dinucleotide sequence 
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selectivity but in difference from DNA MeTase which 
shows preference to hemimethylated substrates dMTase 
prefers fully methylated DNA which is consistent with a 
role for DNA dMTase in altering established methylation 
5 patterns. 

Table 1 



Purification of DNA dMTase 



Purification step 


Total 
protein . 

iMQ) 


Total dpm 


pMole/pg 


pMole/|jg/h 


Fold 
Purification 


Nuclear extract 


6000 


1107.2 


5.5 X 10 ^ 


1.833x10'^ 




DEAE-Sephadex 


3.75 


5844 


0.4674 


0.156 


8445.5 


SP-Sepharose 


0.77 


5106 


1.989 


0.663 


35939.84 


Q-Sepharose 


0.46 


5335 


3.4 


1.13 


62860.65 


DEAE-Sephacel 


0.018 


1834 


30.57 


10.19 


552243.2 



10 Table 2 



Kinetic 


parameters for DNA 


dMTase 


Method 


K„ (DNA) 


V_ (pMole/h) 


Methylated oligo CpG 


2.5 nM 


340 


Hemi-methylated CpG 


6.0 nM 


402 


Methylated SK-DNA 


3.3 nM 


40.42 



Cloning and construction of demethylase expression 
vectors 

15 PGR amplification of the MBD domain of the putative 
demethylase candidate cDNA 

One fig of total RNA prepared from the human 
small lung carcinoma cell line A54 9 was reverse tran- 
scribed using Superscript reverse transcriptase and 
20 random primers (Boehringer) in a 25 ^1 reaction volume 
according to conditions recommended by the manufacturer 
(GIBCO-BRL) . Five fil of reverse transcribed cDNA were 
subjected to an amplification reaction with Taq poly- 
merase (Promega, 1 unit) using the following set of 
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primers: sense 5 ■ CTGGCAAGAGCGATGTC 3 • SEQ ID NO : 9 , 
antisense 5 ' AGTCTGGTTTACCCTTATTTTG 3" SEQ ID NO: 10-^ 

Amplification conditions were: step 1. 95°C 1 
min.; step 2: 94°C 0.5 min; step 3: 45°C 0.5 min.; step 
4- 72°C 1.5 min; steps 2-4 were repeated 30 times. 
MgCl, was adjusted to 1 mM according to conditions rec- 
ommended by the manufacturer. The PGR products were 
cloned in pCR2 . 1 vector (InVitrogen) and the sequence 
of the cDNAs was verified by dideoxy-chain termination 
method using a T7 DNA sequencing kit (Pharmacia) . The 
amplified fragment was excised from the plasmid wxth 
EcoRI, labeled with a Boehringer random prime labeling 
kit according to manufacturer's protocol and alpha P- 
dCTP The label,®d- probe was used to screen a HeLa eel 
cDNA library in XTriplEx phage (Clontech) according to 
standard procedures. Positive clones were identified 
and further purified by serial dilutions ^^^^^^ 
The insert in the pTriplEx plasmid was excised from the 
phage according to manufacturer's protocols and the 
identity of the insert was verified by sequencing . The 
insert was excised by NotI restriction and subcloned 
into either the inducible expression vector: Retro tet 
on (Clontech) in the sense and antisense orientation or 
the pcDNA3.l/His Xpress vector in all three frames and 
in the antisense orientation. 

Transfection and expression of demethylase in verte- 
brate cells 

Ten H9 of either Retro tet on demethylase or 
pcDNA 3.1/His xpress demethylase are mixed with 8 m1 of 
transfection- lypophilic reagent Pfx-2 ^^--^-^^^^^^/^^ 
placed upon 100,000 mouse (3T3 Balb/c, human (A549) or 
monkey cells (CV-1) according to manufacturer's proto- 
col in OPTIMEM medium for 4 hours. Cells are harvested 
, after 48 hours and demethylation and demethylase activ- 
ity is determined by measuring total genomic DNA meth- 
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ylation using standard techniques or a cotransf ected in 
vitro methylated plasmid using a Hpall /Mspl restric- 
tion enzyme analysis. Cellular transformation is meas- 
ured by a soft agar assay. 

5 

Demethylation of pBluescript SK{+) Plasmid 

About 4 ng plasmid pBluescript SK (Stratagene) 
was subjected to methylation using ,SssI methylase. The 
methylated plasmid (4 ng) was incubated for different 

10 time points as indicated with 30 /il of DNA dMTase 
Fraction 4 of DEAE-Sephacel'^'^ column under standard con- 
ditions, extracted with phenol: chloroform and precipi- 
tated with ethanol. About 1 ng of the plasmid were 
subjected to digestion with 10 units each of either of 

15 the restriction endonuclease EcoRII (GIBCO-BRL) , Dpnl , 
or Hpall (New England Biolabs) before and after meth- 
ylation as well as after DNA dMTase treatment in a 
reaction volume of 10 /il for 2 hour at 3 7°C. Following 
restriction digestion the plasmids were extracted with 

20 phenol: chloroform, ethanol precipitated and resuspended 
in 10 fil. The plasmids were electrophoresed on a 0.8% 
(w/w) Agarose gel, transferred onto a Hybond™ Nylon 
membrane and hybridized with pBluescript SK(+) plasmid 
which was "P labeled by random -priming (Boehringer 

25 Mannheim) . 

dMTase activity coelutes with a -45 KDa polypeptide 
when sized iinder denaturing conditions but migrates as 
a higher molecular weight complex under non denaturing 

30 conditions. dMTase was purified up to 500,000 fold by 
four chromatographic steps (Table 1} - we first deter- 
mined the identity of the polypeptide associated with 
dMTase activity by SDS-PAGE analysis of the active 
fractions. As observed in Fig. 7a, a cluster of 4 

35 polypeptide bands from -44 KDa to 35 KDa coelute with 
dMTase activity in the last two chromatographic steps 
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(the lower fragment might be a degradation product as 
evidenced by its abundance in the later chromatographic 
steps) . However when the active DEAE-Sephacel fraction 
is size fractionated on a 4% non denaturing acrylamide 
5 column, the dMTase activity elutes at the high molecu- 
lar weight of -170 KDa (Fig. 7c, fraction 63). SDS- 
PAGE analysis of this fraction (63) reveals only two 
bands (Fig. 7b) observed in the active chromatographic 
fractions (Fig- 7a) . To further determine whether 
10 dMTase is found in a multimeric complex, fraction 63 
was size fractionated on a glycerol gradient (Fig- 7d) 
and DNA dMTase activity eluted at the -170 kDa range. 
AS only two main small polypeptides were identified m 
fraction 63 (approximately 35-43 KDa) , dMTase is proba- 
bly found in either a homomeric complex if only one of 
the two peptides is dMTase or a heteromeric complex if 
both polypeptides are associated with dMTase activity. 

a. Identification of a lead DNA dMTase candidate by 
homology search of dbEST 

As the purification of dMTase suggests that tne 
dMTase is of very low abundance, only -19 ng of dMTase 
could be isolated from 6 mg of nuclear extract 
(Table 1) , we opted for cloning the dMTase based on its 
following functional properties. First, since dMTase 
specifically demethylates methylated CG dinucleotides , 
we assumed that it should bear the ability to recognize 
methylated CG dinucleotides. Second, the demethylase 
transforms methylated cytosine in DNA to cytosme. 
Third, the demethylase releases the methyl group as a 

volatile compound. 

previous reports have shown that proteins inter- 
acting with methylated DNA share a common domain 
(MDBD) . A TBLAST^ search of the dbEST database identi- 
fied a novel expression tag cDNA (from a T-cell lym- 
phoma Homo sapiens cDNA 5' end) (gb/AA361957/AA361957 
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EST71295) and the mouse homologue ( (gb/W97165/W97165 
mf90g05.rl) from Soares mouse embryo NbME13.5) with 
unknown function that bears homology to the MDBD 
(Fig. 8a) . A search of the GenBank database verified 
5 that it is a novel cDNA that has not been included in 
GenBank. Alignment of the novel EST and MeCP2 and 
MeCPl associated protein has revealed no homology 
beyond the previously characterized' MDBD which is con- 
sistent with a different function for this methylated 

10 DNA binding protein. A 201bp fragment bearing the 
sequence identified in the search was reverse tran- 
scribed and amplified from human lung cancer cell line 
A54 9 RNA and was used to screen a cDNA library from 
Hela cells. The largest insert cloned was of 1.36 kb 

15 size and its sequence identity with the EST sequence 
was determined- The cDNA is novel and has no homologue 
in GenBank and no function has ever been assigned to 
it. A virtual translation of the protein identified an 
open reading frame (ORF) of 262 amino acids (Fig. 8b) . 

20 The ORF may extend further 5' as no in frame stop codon 
was found upstream of this ATG. However, RACE analy- 
ses and further searches of the dbEST have failed to 
identify 5' sequences upstream to the one identified in 
our screening. 

25 A BLAST search of the candidate protein using 

the Predict protein server against a database of pro- 
tein domain families has identified only the MDBD 
domain and found no homologue to the sequence in the 
data base search. No other functional motifs were 

30 identified by the Prosite analysis. This is consistent 
with a novel biochemical function for this protein. A 
coiled coil prediction of the sequence identified a 
coiled coil domain which is known to play a role m 
protein protein interactions. 
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The identified cDNA encodes an mRNA that is 
widely expressed in human cells as revealed by a North- 
ern blot analysis of human poly A. mRNA (Fig. 8c) as 
one major transcript of - 1 • 6 kb which is close to the 
5 Size of the cloned cDNA, verifying that the cloned cDNA 
does not represent a highly repetitive RNA but rather a 
nO^A encoded by a single or low copy number gene, 
in vitro translated candidate cDNA bears dMTase activ- 

''"^ A conclusive proof for the existence of a single 
protein that bona fide demethylates DNA is to demon- 
strate that an in vitro translated candidate cDNA can 
volatilize methyl groups from methylated DNA and trans- 
15 form a methyl cytosine to cytosine in an isolated sys- 
tem The candidate dMTase cDNA was subcloned xt xnto a 
pcDNA3.l/His Xpress (INVITROGEN) expression vector xn 
the putative translation frame (pcDNA3.1His A) and xn a 
single base frame shift (pcDNA3.1His B) , and was xn 
20 vitro transcribed and translated in the presence of 
3^S-methionine and the resulting translation products 
were resolved by SDS-PAGE. Autoradiography revealed a 

/i?-;^ -xn^) The apparent size of the in 
«40KDa protein (Fig. lOa; . me cippc* 

Wtro translated protein is shorter by -3-S KBa £ro™ 
.5 the apparent size of the purified protein. The cloned 
ODNA might be missing some upstream amino ac.ds as dis- 
cussed above or might be differently rrodified in hu^an 

ceX3>s ' 

TWO tests established whether the in vitro 

30 translated candidate cDNA is a ^ona fide 

first tested ■ Whether In vitro translated protein 
(purified on a Ni2. charged agarose resin) can volatil- 
ize and release methyl residues in I'Hl -CH.-Dm using a 
radioactive trapping volatilization assay. To verity 

35 that the volatilized counts are true 'H counts a spec- 
trum analysis was performed. As demonstrated m Fig. 
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10b no volatilization of tritiated methyl residues is 
observed in the misframe dMTase (misframe) whereas in 
vitro translated putative dMTase cDNA catalyzes the 
volatilization of 'H-CH3 residues which are trapped in 
5 the scintillation cocktail. 

Second, in vitro translated dMTase cDNA trans- 
forms CH3-cytosine residing in [''P] -a-dGTP labeled 
plasmid DNA or in [methyl -dC32pdGl n double stranded 
oligomer DNA to cytosine, whereas a frame shift in 

10 vitro translated dMTase does not demethylate DNA (Fig. 
lOd) . This demonstrates that the dMTase activity is 
dependent on the dMTase translation product and not a 
contaminating activity found in the in vitro transla- 
tion kit that copurifies with the putative dMTase. The 

15 reaction carried out by the in vitro translated dMTase 
displays: dependence on the dose of in vitro translated 
product (Fig. 10c), time dependence (Fig. lOd) and 
dependence on translated protein (Fig. 10b & d mis- 
frame. Fig. 10c protease K treatment) . Taken together, 

2 0 these results strongly suggest that the cDNA cloned 
here codes for a bona fide enzymatic DNA demethylase 
activity. 

Transiently trans fee ted dMTase cDNA demethylates DNA 

25 dMTase cDNA and the pcDNA3 . IHisC vector control 

were transiently transfected into human embryonal kid- 
ney cells to test whether the cDNA can direct expres- 
sion of dMTase activity in human cells. The His-tagged 
proteins were bound to Ni2+ agarose resin and eluted 

30 from the resin, with increasing concentrations of imida- 
zole. The expression of the transfected dMTase was 
verified by a Western blot analysis (Fig. lib) . The 
imidazole fractions were assayed for their ability to 
volatilize and release methyl residues in [^H] -CH3-DNA 

35 using a radioactive trapping volatilization assay 1. 
As observed in Fig. 11a, imidazole fractions from 
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dMTase transfected cells volatilize [^H] -CH3 whereas no 
tritiated counts are detected in DNA treated with imi- 
dazole fractions from cells transfected with a misframe 
mutation of dMTase or non transfected cells. The tran- 
5 siently expressed dMTase transforms methylated cytosine 
in DNA to cytosine residing in two different substrates 
(Figs, lie Sc lid), in a protein dependent manner (Figs, 
lie & lie) and the reaction displays substrate depend- 
ence and saturability (Fig. Hf ) • Transiently 
10 expressed dMTase was loaded on a non denaturing glyc- 
erol gradient to determine its native MW. Similar to 
dMTase purified from human cells, cloned and purified 
dMTase activity fractionated at the 160-190 KDa range 
(data not shown) . This is consistent with self asso- 
15 ciation of cloned dMTase possibly mediated by the 
coiled-coil domain. 

Cloned DNA dMTase catalyzes a hydrolysis of B-methyl- 
cytosine to release methanol 

20 we determined the mechanism by which methyl 

residues are released by the cloned dMTase (from Fig. 
11) and compared it to the purified bona fide dMTase 
activity. increasing amounts of non labeled [methyl- 
dCpdG] DNA were incubated with either the bona fade 
25 dMTase activity purified from A549 cells or the cloned 
dMTase in the presence of [^H] water for 3 hours fol- 
lowed by digestion to mononucleotides, a thin layer 
chromatography and autoradiography. As Fig. 12a shows, 
both reactions replace the methyl group in 5-methylcy- 
30 tosine with a proton donated from water as indicated by 
the presence of [^H] label in cytosine. 

The identity of the leaving methyl group in the 
demethylation reaction catalyzed by the purified bona 
fide dMTase activity was shown to be methanol. In 
35 order to identify the form that the methyl residue 
leaves as in the demethylation reaction catalyzed by 
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the cloned dMTase an identical gas chromatography/mass 
spectrometry analysis of the reaction products was per- 
formed as inl . Only the properly translated form of 
dMTase (both in vitro translated and transiently trans- 
5 fected and purified) is able to produce ions character- 
istic of methanol in a mass spectrometric analysis 
(mass of 32 and 29, Fig. 12b). These results suggest 
that the demethylation reaction catalyzed by the cloned 
dMTase is hydrolysis of the 5 -methyl -cytosine to cyto- 
10 sine and methanol as described for the purified 
dMTasel . 

DNA dMTase activity is undetectaODle in nontrans formed 
cells 

The assays for dMTase activity described here 

15 and the cloning of DNA dMTase cDNA enables a study of 
its expression at different cellular states. Global 
hypomethylation of DNA is a common observation in can- 
cer cells. This has been a perplexing observation, 
since DNA MeTase activity is elevated in cancer cells. 

2 0 Hyperactivation of DNA MeTase has been proposed to play 
a role in cancer development. This paradox raises 
questions on the proposed role of the elevated levels 
of DNA MeTase in cancer cells. One simple explanation 
that has been previously suggested to resolve this 

25 paradox is that cancer cells express induced levels of 
DNA dMTase. We compared the DNA dMTase activity in 
equal concentrations of DEAE-Sephadex fractionated 
nuclear extracts (fractions 9-10) prepared from a num- 
ber of carcinoma cell lines H446, Colo 205, Hela, and 

30 A54 9 with a similar preparation from human skin fibro- 
blast cells at initial rate conditions using 
[mdC32pdG]n double stranded oligomer as a substrate. 
As observed in Fig. 13a, whereas DNA dMTase activity is 
readily observed in all carcinoma cell lines, it is 

35 undetectable in nontransf ormed human cells. The 
absence of dMTase activity in human primary cells 
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reflects the situation in vivo since dMTase activity is 
undetectable in preparations from different murine tis- 
sues whereas dMTase activity is present in a murine 
carcinoma cell line P19 that was transfected with the 
H-Ras protooncogene, or human tumors carried as xeno- 
grafts in the same strain of mouse (Fig. la: COLO 205, 
A549. Hela) . These conclusions were verified using the 
radioactive-trapping volatilization' assay shown in Fig. 

13c. 

Since dMTase mRNA has been detected using a sen- 
sitive poly A+ Northern blot in all normal human tis- 
sues, we tested the hypothesis that the absence of 
detected dMTase activity in normal tissues reflects a 
quantitative difference in DNA dMTase mRNA between nor- 
mal tissues and cancer lines. A Northern blot analysis 
and quantification of dMTase mRNA by a slot blot analy- 
sis shown in Fig. 13d using total RNA supports thxs 
hypothesis. Whereas minute levels of dMTase mRNA are 
detected in normal tissues, high levels of dMTase are 
expressed in a murine carcinoma cell line Yl that bears 
a 30 fold amplification of Ha-ras. 

A second DNA demethylase dMTase2 identified in human 
and mouse 

cDNA sequences, predicted amino acid sequences, and 
GenBank accession numbers of both dMTasel and dMTase2 
from human and mouse are shown. We claim that the hxgh 
level of identity of the two proteins (Figs 9c and e) 
suggests that the two proteins can perform the same 
function, DNA demethylation. The N-terminals of 

dMTasel and dMTase2 contain a Methylated DNA Binding 
Domain (MBD) and near their C-terminals is a coiled- 
coil domain, however the middle portions of the protein 
sequences have no homology to any know structural or 
catalytic motif. Importantly, their middle regions are 
35 still extensively homologous suggesting that the cata- 
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lytic site of the demethylase activity lies in this 
area on both proteins . 

Induced expression of DNA demethylase in the Antisense 
orientation inhibits tiomorigenesis ex vivo 
5 To test the hypothesis that inhibition of DNA 

dMTase can inhibit tumorigenesis tetracycline inducible 
vectors carrying the human dMTasel cDNA in either the 
sense or antisense orientation were constructed and 
transiently transfected into HEK 293 cells, treated for 

10 4 8 hours either in the presence or absence of doxycy- 
cline (a tetracycline analogue) , selected for the last 
24 hours with puromycin, and then plated on soft agar 
and allowed to grow for seven days. After seven days 
colonies were scored and the data presented clearly 

15 show that doxycycline induced expression of the dMTasel 
cDNA in the antisense orientation reduced colony forma- 
tion (Fig. 15) . 

Imidazole is a small molecule inhibitor of DNA 
demethylase activity 

20 A template small molecule, imidazole, was tested 

for the ability to inhibit DNA dMTase activity. In a 
volatilization of radioactive methyl residues assay, 
concentrations from 1/zM to lOmM of imidazole were incu- 
bated in a typical volatilization of radioactive methyl 

25 residues as described above. The graph clearly demon- 
strates a dose dependent inhibition of DNA dMTase 
activity by imidazole, and validates a rationale for 
testing imidazole based molecules as inhibitors of DNA 
dMTase activity (Fig. 16) . 

30 Identification of DNA demethylase cDNAs and protein 
sec[uences 

Fig. 9a illustrates cDNA sequence of human dMTasel (SEQ 
ID NO:l) and its predicted amino acid sequence (SEQ ID 
NO:2), including its Genbank location. Fig. 9b illus- 
35 trates cDNA sequence of human dMTase2 (SEQ ID NO: 3) and 
its predicted amino acid sequence (SEQ ID NO:4) , includ- 
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ing its GenBank location. Fig- 9c illustrates protexn 
sequence alignment of human dMTasel and human dMTase2 . 
Fig. 9d illustrates cDNA sequence of mouse dMTasel (SEQ 
ID NO: 5) and its predicted amino acid sequence (SEQ ID 
5 N0:6), including its GenBank location. Fig. 9e illus- 
trates CDNA sequence of mouse dMTase2 (SEQ ID NO: 7) and 
its predicted amino acid sequence (SEQ ID N0:8), 
including its GenBank location. Fig. 9f illustrates 
protein sequence alignment of mouse dMTasel and mouse 

10 dMTase2 . 

While the invention has been described in con- 
nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 
15 tions, uses, or adaptations of the invention following, 
in general, the principles of the invention and 
including such departures from the present disclosure 
as come within known or customary practice within the 
art to which the invention pertains and as may be 
20 applied to the essential features hereinbefore set 
forth, and as follows in the scope of the appended 
claims . 
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WHAT IS CLAIMED IS ; 

1 . A DNA demethylase enzyme and/or homologue 
thereof having about 4 0 KDa, and wherein said DNA 
demethylase enzyme is overexpressed in cancer cells. 

2 . A cDNA encoding a human demethylase which com- 
prises a sequence set forth in SEQ TD NOS:l and 3. 

3 . A cDNA homologous to the cDNA of claim 2 , 
wherein said cDNA encoding mouse demethylase set forth 
in SEQ ID NOS : 5 and 7. 

4. The use of the expression of demethylase cDNA of 
claims 2 or 3 to alter DNA methylation patterns of DNA 
in vitro in cells or in vivo in humans, animals and in 
plants . 

5. The use of claim 4, wherein said demethylase 
CDNA expression is under the direction of mammalian 
promoters . 

6. The use of claim 5, wherein said promoter is 
CMV. 

7. The use of claim 4, wherein said demethylase 
cDNA expression is under plant specific promoters to 
alter methylation in plants and to allow for altering 
states of development of plants and expression of for- 
eign genes in- plants. 

8. The use of claim 4, wherein said demethylase 
cDNA expression is in the antisense orientation to 
inhibit demethylase in cancer cells for therapeutic 
processes . 
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9. The use of claim 9, wherein expression of 
demethylase cDNA in mammalian cells is to alter their 
differentiation state and to generate stem cells for 
therapeutics, cells for animal cloning and to improve 
expression of foreign genes. 

10. The use of the expression of demethylase cDNA of 
claims 2 or 3 in bacterial or insect cells for produc- 
tion of large amounts of demethylase. 

11. The use of the expression of demethylase cDNA of 
claims 2 or 3 for the production of protein in verte- 
brate, insect or bacterial cells. 

12. The use of claim 11 for producing antibodies 
against demethylase . 

13. The use of the sequence of demethylase cDNA of 
claim 2 as a template to design antisense oligonucleo- 
tides and ribozymes. 

14. The use of the predicted peptide sequence of 
demethylase cDNA of claim 2 to produce polyclonal or 
monoclonal antibodies against demethylase. 

15. The use of expression of cDNA of claim 2 or 3 in 
two hybrid systems in yeast to identify proteins inter- 
acting with demethylase for diagnostic and therapeutic 
purposes . 

16. The use of expression of cDNA of claim 2 or 3 in 
bacterial, vertebrate or insect cells to produce large 
amounts of demethylase for high throughput screening of 
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demethylase inhibitors for therapeutics and biotechnol- 
ogy and for obtaining the x-ray crystal structure. 

17. A volatile assay for high throughput screening 
of demethylase inhibitors as therapeutics and antican- 
cer agents which comprises the steps of : 

a) using transcribed and translated demethylase 
cDNA of claim 2 or 3 in vitro to convert methyl - 
cytosine present in methylated DNA samples to 
cytosine present in DNA and volatilize methyl 
group ; 

b) determining the absence or minute amount of 
volatilize methyl group as an indication of an 
active demethylase inhibitor. 

18. A volatile assay for the diagnostics of cancer 
in a patient sample which comprises the steps of: 

a) determining demethylase activity in patient sam- 
ples by determining conversion of methyl -cyto- 
sine present in methylated DNA to cytosine pres- 
ent in DNA and volatilization of the methyl 
group released as methanol; 

b) determining the presence or minute amount of 
volatilized methyl group as an indication of 
cancer in said patient sample. 

19. Use of an antagonist or inhibitor of DNA demeth- 
ylase of claim 1 or 2 for the manufacture of a medica- 
ment for cancer treatment, for restoring an aberrant 
methylation pattern in a patient DNA, or for changing a 
methylation pattern in a patient DNA. 

20. Use according to claim 19, wherein said antago- 
nist is a double stranded oligonucleotide that inhibits 
demethylase at a Ki of 50nM. 
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21. use according to claim 20, wherein said oligonu- 

cleotide is fc^GCWGC-'Gl . 

iG^CG^CG^CG^cJ n 



22 



Use according to claim 19, wherein the inhibitor 
comprises an anti-DNA demethylase antibody or an 
oligonucleotide of DNA demethylase or a small 



ant 1 sense 
molecule. 

23. use according to one of claims 19 or 22, wherein 
the change of the methylation pattern activates a 
silent gene. 

24. use according to claim 23, wherein the activa- 
tion of a silent gene permits the correction of genetic 
defect . 

25. use according to claim 24, wherein said genetic 
defect is p-thalassemia or sickle cell anemia. 

26. use of the demethylase of claim 1, for removing 
methyl groups on DNA in vitro. 



27 



Use of the demethylase of claim 1 or its cDNA of 
claim 2, for changing the state of differentiation of a 
cell to allow gene therapy, stem cell selection or cell 
cloning . 

28. use of. the demethylase of claim 1 or its cDNA, 

of claim 2 for inhibiting methylation in cancer cells 
using vector mediated gene therapy. 



29. An assay for the diagnostic of cancer 

patient, which comprises determining the level of 
expression of DNA demethylase of claim 1 in a sample 
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from said patient, wherein overexpression of said DNA 
demethylase is indicative of cancer cells. 
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SEQUENCE LISTING 

<110> McGILL UNIVERSITY 
SZYF, Moshe 

BHATTACHARYA , San joy K. 
RAMCHANDANI , Shyam 



<120> DNA DEMETHYLASE, THERAPEUTIC AND 
DIAGNOSTIC USES THEREOF 

<130> 1770-183"PCT" FC/ld 

<150> CA 2,220,805 
<151> 1997-11-12 

<150> CA 2,230,991 
<151> 1998-05-11 

<160> 10 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1804 
<212> DNA 
<213> Unknown 

„.«c::c°%^c3.,,c,. .tc.ccg.a „c„ .C3.«.c,, aae^ JO 
gagccggctg gggagggggc tggatgcgcg cgcacccggg tccgccatag 180 

IgLggagga gggggagagc gcggcgggcg 9cagcggcgc tggcggcgac tccg | 
agcagggggg ccagggcagc gcgctcgctc ^f l^lllf ggcggcggcg 300 

gcgctcgggg cggcggccgt ggccgggggc ggtggaagca 99^99 99 
?c?gtggccg tggccgtggc cgtggccggg 9^9999^^9 ^99^^99^gc ^9999 9gg 
gccgcggccg tccccagagt Sgcggcagcg 9Ccttggcgg C9acggc99c 99^99^9^93 
gcggctgcgg cgtcggcagc ggtggcggcg tcgccccccg S^ggy qggaagagga 54 0 

cg?cggggag ctcggggccg gggcccaggg gaccccgggc "=9gagagc 999 ^ 
t|gal?gccc ggccctcccc cccggatgga agaaggagga jg^g^^ccga ^aatcaggg 
tclgtgctgg caagagcgat gtctactact tcagtccaag tggtaagaag ^tcag^^g 
aacctcagct ggcaagatac ctgggaaatg <==^gttgacct ^agcagc J ^^^^^ 
ccggcaagat gatgcctagt aaattacaga agaacaagca 9agactccgg ^^^9^ 
tcaatcagaa caagggtaaa ccagacctga acacaacact 9 900 

caattttcaa gcaaccagta accaaattca cgaaccaccc sagcaataag 9tg^^9 9 

acccccagcg gatgaatgaa "^^"^9^^ Saaaacca? ISlctacc? aaaggtcttc 1020 
ttagcgcatc agatgtaaca gaacaaatta ^aaaaaccat 99 9 ^gctttac 1080 

aaggagtcgg tccaggtagc aatgacgaga ^""^^9tc ^gctgtggcc 9^9^ 
acacaagctc tgcgcccatc acaggacaag tctctgctgc ^9tgga 9 ^300 
tttggcttaa cacatctcaa cccctctgca aagctttcat tgttacagat g^J9 
ggallcagga agagcgagtc caacaagtac gcaagaaact 9gaggaggca ^^gatgg g ^^^^ 
Icatcctgtc ccgggctgcg gacacggagg aagtagacat ^9acatggac ^9^99 9 9 ^^^^ 
aggcgtaaga atatgatcag gtaactttcg ^ctgaccttc ^^=^^9^9-^ ^ ^^^^ 

aJLgaatta aaacatttcc actgggt.tc 9cctgtaaga aaaagtgtac c^g^g^^^^^ ^^^^ 
agctttttaa tagcactaac caatgccttt ttagatgtac ^"^9 y taataacaaq 
Iltccaaatg atgtttattt tgaatcctag 9acttaaaat 9agtctttta ^aatagcaag 
cagggccctt ccggtgcagt gcagctttga 9gccaggtgc agtctactgg 99^ 
cttacgtgaa atatttgttt cccccacagt tttaatataa acag yy 



1560 
1620 
1680 
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aagtttccca attaaagatt attatacttc actgtatata aacagatttt tatactttat 174 0 
tgaaagaaga tacctgtaca ttcttccatc atcactgtaa agacaaataa atgactatat 1800 
tcac 1804 



<210> 2 

<211> 411 

<212> PRT 

<213> Unknown 



<400> 2 



Met 


Arg 


Ala 


His 


1 

Gly 


Glu 


Ser 


Ala 








20 


Glu 


Gin 


Gly 


Gly 






35 




Val 


Arg 


Arg 


Glu 




50 






Lys 


Gin 


Ala 


Gly 


65 








Gly 


Arg 


Gly 


Arg 


Pro 


Pro 


Ser 


Gly 








100 


Gly 


Gly Gly 


Ser 






115 




Phe 


Pro 


Ser 


Gly 




130 






Glu 


Ser 


Gly 


Lys 


145 








Lys 


Glu 


Glu 


Val 


Val 


Tyr 


Tyr 


Phe 








180 


Leu 


Ala 


Arg 


Tyr 






195 




Arq 


Thr 


Gly 


Lys 




210 






Leu 


Arg 


Asn 


Asp 


225 








Thr 


Thr 


Leu 


Pro 


Thr 


Lys 


Val 


Thr 








260 


Arg 


Met 


Asn 


Glu 






275 




Gly 


Leu 


Ser 


Ala 




290 






Leu 


Pro 


Lys 


Gly 


305 








Leu 


Leu 


Ser 


Ala 


Thr 


Gly Gin 


Val 








340 




Thr 


Ser 


-Gin 






355 





Pro 


Gly 


Gly 


Gly 


5 








Ala 


Gly 


Gly 


Ser 


Gin 


Gly 


Ser 


Ala 








40 


Gly 


Ala 


Arg 


Gly 






55 




Arg 


Gly 


Gly 


Gly 




70 






Gly 


Arg 


Gly 


Arg 


85 








Gly 


Ser 


Gly 


Leu 


Gly 


Gly 


Gly 


Gly 








120 


Ser 


Ala 


Gly 


Pro 






135 




Arg 


Met 


Asp 


Cys 




150 






He 


Arg 


Lys 


Ser 


165 








Ser 


Pro 


Ser 


Gly 


Leu 


Gly 


Asn 


Thr 








200 


Met 


Met 


Pro 


Ser 






215 




Pro 


Leu 


Asn 


Gin 




230 






He 


Arg 


Gin 


Thr 


245 








Asn 


His 


Pro 


Ser 


Gin 


Pro 


Arg 


Gin 








280 


Ser 


Asp 


Val 


Thr 






295 




Leu 


Gin 


Gly 


Val 




310 






Val 


Ala 


Ser 


Ala 


325 








Ser 


Ala 


Ala 


Val 


Pro 


Leu 


Cys 


Lys 



360 



Arg Cys Cys Pro 
10 

Gly Ala Gly Gly 
25 

Leu Ala Pro Ser 

Gly Gly Arg Gly 
60 

Val Cys Gly Arg 
75 

Gly Arg Gly Arg 
90 

Gly Gly Asp Gly 
105 

Ala Pro Arg Arg 

Gly Pro Arg Gly 
140 

Pro Ala Leu Pro 
155 

Gly Leu Ser Ala 
170 

Lys Lys Phe Arg 
185 

Val Asp Leu Ser 

Lys Leu Gin Lys 
220 

Asn Lys Gly Lys 
235 

Ala Ser He Phe 
250 

Asn Lys Val Lys 
265 

Leu Phe Trp Glu 

Glu Gin He He 
300 

Gly Pro Gly Ser 
315 

Leu His Thr Ser 
330 

Glu Lys Asn Pro 
345 

Ala Phe He Val 



Glu Gin Glu Glu 
15 

Asp Ser Ala He 
30 

Pro Val Ser Gly 
45 

Arg Gly Arg Trp 

Gly Arg Gly Arg 
80 

Gly Arg Gly Arg 
95 

Gly Gly Cys Gly 
110 

Glu Pro Val Pro 
125 

Pro Arg Ala Thr 

Pro Gly Trp Lys 
160 

Gly Lys Ser Asp 
175 

Ser Lys Pro Gin 
190 

Ser Phe Asp Phe 
205 

Asn Lys Gin Arg 

Pro Asp Leu Asn 
240 

Lys Gin Pro Val 
255 

Ser Asp Pro Gin 
270 

Lys Arg Leu Gin 

285 

Lys Thr Met Glu 

Asn Asp Glu Thr 
320 

Ser Ala Pro He 
335 

Ala Val Trp Leu 
350 

Thr Asp Glu Asp 
365 
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lie Arg Lys Gin Glu Glu Arg Val Gin Gin Val Arg Lys Lys Leu Glu 



375 

Olu Vll Leu Met Ala Asp He Leu Ser Arg Ala Ala Asp Thr Glu Glu 



Met ASP He Glu Met Asp Ser Gly Asp Glu Ala 



390 



395 



405 

<210> 3 
<211> 1589 
<212> DNA 
<213> Unknown 



410 



^is^ ™s Sill gas EiS i 

s—r/c ^ ™ r/— ^3:: 

ggggcgcaat ggagcggaag ^ga^SSgagt ^^^^f I ggatgtcttt tactatagcc 360 

aagaagtgcc caggaggtcg gggctgtcgg ^^S^^^^S ??acctgggc ggatccatgg 420 
ccagcgggaa gaagttccgc agcaagccac ^actggcacg "acctggg 99^ 

accccagcac cttcgacttc cgcaccggaa ^gatgttgat SJ^^^^a ^ ctgaacaccg 540 
gccagcgtgt gcgctatgat tcttccaacc aggt^aggg -agcctgac 

cgctgcctgt acggcagact gcatccatct tcaagcaacc 99^9^^ ^ ggo 

accccagcaa caaggtcaag agcgacccgc agaaggcagt gg^^^^|^^9 cSgtcagga 720 
tctgggagaa gaagctaagt ggattgagtg cctttgacat ^gcagaagaa ^^gg 99 

^ ^ 5 s y »4:ci 
^-is^c K -™ -r.sri3 

aagagccgga gccagagcga gtgtagcaca 99^9^ ^ aoaaaacaqc cgtccacctc 

gccttcagcc ttgcctggac caggtagggg ccagacctgt ^9g;99cagc ^9 ^^^^ 

?tttcca!ag cctcctgctt ccaggtctca 9tgcagggag ^-^^^gtgga ccttga ^^^^ 

acttgtccct gcgctgcctg gcaggaagcc ccacactgaa agcagatgag ^ag J^^^ ,33^ 

actgagaggc cacctggaca cagtcacctc ^^tgcctcct "tcatagg ^ ^^^q 

cttggcaccg aggagctggg agccgtgttg ggtgctggag 9aagtttctg 9^ ^^qq 

tggcLtgcc caccttatgt ccctaaggct attacaggcc -ggg^^^99a ^^9^^^^99^ ^^^^ 

ccacagggct gcccagcctc cccacactga gggtcagcag cccaccagga age ^^^^ 
cttcaataaa ctgatggtag gaacttgtg 



<210> 4 
<211> 291 
<212> PRT 
<213> Unknovm 



Met Gl^ Ar^ .ys Arg Trp Glu Cys Pro Ala I.eu Pro Gin Gly Trp Glu 
A^g Glu Glu val Pro Arg Arg Ser Gly Leu Ser Ala Gly His Arg Asp 
val Phe Tyr Ser Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin 

Leu Ala Arg Tyr Leu Gly Gly Ser Met Asp Leu Ser Thr Phe Asp Phe 

55 

Arg Thr Gly Lys Met Leu Met Ser Lys Met Asn Lys Ser Arg Gin Arg 



65 



70 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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Val 


Arg 


Tyr 


Asp 


Ser 


Ser 


Asn 


Gin 


Val 


Lys 


Gly Lys 


Pro 


Asp 


Leu 


Asn 






85 










90 










95 




Thr 


Ala 


Leu 


Pro 


Val 


Arg 


Gin 


Thr 


Ala 


Ser 


He 


Phe 


Lys 


Gin 


Pro 


Val 








100 










105 










110 






Thr 


Lys 


He 


Thr 


Asn 


His 


Pro 


Ser 


Asn 


Lys 


Val 


Lys 


Ser 


Asp 


Pro 


Gin 




115 










120 










125 








Lvs 


Ala 


Val 


Asp 


Gin 


Pro 


Arg 


Gin 


Leu 


Phe 


Trp 


Glu 


Lys 


Lys 


Leu 


Ser 


130 










135 










14 0 










Gly 


Leu 


Asn 


Ala 


Phe 


Asp 


He 


Ala 


Glu 


Glu 


Leu 


Val 


Lys 


Thr 


Met 


Asp 


145 










150 










155 










160 


Leu 


Pro 


Lys Gly 


Leu 


Gin 


Gly 


Val 


Gly 


Pro Gly Cys 


Thr 


Asp 


Glu 


Thr 










165 










170 










175 




Leu 


Leu 


Ser 


Ala 


He 


Ala 


Ser 


Ala 


Leu 


His 


Thr 


Ser 


Thr 


Met 


Pro 


He 








180 










185 










190 






Thr 


Gly 


Gin 


Leu 


Ser 


Ala 


Ala 


Val 


Glu 


Lys 


Asn 


Pro 


Gly 


Val 


Trp 


Leu 




195 










200 










205 








Asn 


Thr 


Thr 


Gin 


Pro 


Leu 


Cys 


Lys 


Ala 


Phe 


Met 


Val 


Thr 


Asp 


Glu 


Asp 




210 










215 










220 










He 


Arg 


Lys 


Gin 


Glu 


Glu 


Leu 


Val 


Gin 


Gin 


Val 


Arg 


Lys 


Arg 


Leu 


Glu 


225 






230 










235 










240 


Glu 


Ala 


Leu 


Met 


Ala 


Asp 


Met 


Leu 


Ala 


His 


Val 


Glu 


Glu 


Leu 


Ala 


Arg 










245 










250 










255 




Asp 


Gly 


Glu 


Ala 


Pro 


Leu 


Asp 


Lys 


Ala 


Cys 


Ala 


Glu 


Asp 


Asp 


Asp 


Glu 




260 










265 










270 






Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Pro 


Asp 


Pro 


Asp 


Pro 


Glu 


Met 




275 










280 










285 









Glu His Val 
290 



<210> 5 
<211> 1966 
<212> DNA 
<213> Unknown 



60 



<400> 5 

gggggcgtgg ccccgagaag gcggagacaa gatggccgcc catagcgctt ggaggaccta 

agaggcggtg gccggggcca cgccccgggc aggagggccg ctctgtgcgc gcccgctcta 12 0 

tgatgcttgc gcgcgtcccc cgcgcgccgc gctgcgggcg gggcgggtct ccgggattcc 180 

aagggctcgg ttacggaaga agcgcagcgc cggctgggga gggggctgga tgcgcgcgca 240 

cccgggggga ggccgctgct gcccggagca ggaggagggg gagagtgcgg cgggcggcag 300 

cggcgctggc ggcgactccg ccatagagca ggggggccag ggcagcgcgc tcgccccgtc 360 

cccggtgagc ggcgtgcgca gggaaggcgc tcggggcggc ggccgtggcc gggggcggtg 420 

gaagcaggcg ggccggggcg gcggcgtctg tggccgtggc cggggccggg gccgtggccg 4 80 

gggacgggga cggggccggg gccggggccg cggccgtccc ccgagtggcg gcagcggcct 54 0 

tggcggcgac ggcggcggct gcggcggcgg cggcagcggt ggcggcggcg ccccccggcg 600 

ggagccggtc cctttcccgt cggggagcgc ggggccgggg cccaggggac cccgggccac 660 

ggagagcggg aagaggatgg attgcccggc cctccccccc ggatggaaga aggaggaagt 72 0 

gatccgaaaa tctgggctaa gtgctggcaa gagcgatgtc tactacttca gtccaagtgg 780 

taagaagttc agaagcaagc ctcagttggc aaggtacctg ggaaatactg ttgatctcag 840 

cagttttgac ttcagaactg gaaagatgat gcctagtaaa ttacagaaga acaaacagag 900 

actgcgaaac gatcctctca atcaaaataa gggtaaacca gacttgaata caacattgcc 960 

aattagacaa acagcatcaa ttttcaaaca accggtaacc aaagtcacaa atcatcctag 1020 

taataaagtg aaatcagacc cacaacgaat gaatgaacag ccacgtcagc ttttctggga 10 80 

gaagaggcta caaggactta gtgcatcaga tgtaacagaa caaattataa aaaccatgga 1140 

actacccaaa ggtcttcaag gagttggtcc aggtagcaat gatgagaccc ttttatctgc 1200 

tgttgccagt gctttgcaca caagctctgc gccaatcaca gggcaagtct ccgctgctgt 1260 

ggaaaagaac cctgctgttt ggcttaacac atctcaaccc ctctgcaaag cttttattgt 1320 
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<210> 6 

<211> 414 

<212> PRT 

<213> Unknown 



1380 
1440 
1500 
1560 



.aca,a..aa .,c,tca.3. sc,™ c»|e.c,c. 

agaagcactg atggcagaca tcttgtcgcg ^g«=^9"y ;,ctttcaacc gactttcccc 
altglacagt ggagatgaag cctaagaata tgatcaggta -tttcgacc 9 ^ 
aagrgaaaat tcctagaaat tgaacaaaaa tgtttccact 9gctt g^^ ^l^^^.tgta 1620 
aaaatgtacc cgagcacata gagcttttta atagcac = ^cc taggacttaa 1680 

tttttgatgt atatatctat tattcaaaaa ^tcatgttta "ttgagtcc 99 ^.^^ 
aattagtctt ttgtaatatc ^agcaggacc ^^^^9atgaa 9-^9^9 ^.^^taata 1800 

tgcaatctac tggaaatgta 9cacttacgt aaaacatttg ^ cttcactgta I860 

llllllllll reaJ^rafag gggLaccJ „.tc ca.ca.cac. 

gtaaagacaa ataaatgatt atattcacaa aaaaaaaaaa aaaaaa 



1920 
1966 



Met ..Till His pro Oly Oly Oly Arg Cys Cy. Pro Glu Oln Olu Olu 
oly Glu Ser Ala Ala Gly Gly Ser Gly Ala Gly Gly Asp Ser Ala lie 
Clu Gin Gly G^y Gin Gly Ser Ala Leu Ala Pro Ser Pro Val Ser Gly 
Val Arg ^g Glu Gly Ala Arg G^ly Gly Gly Arg Gly Arg Gly Arg Trp 
^ys gL Ala Ala Arg Gly Gly Gly Val Cys Gly Arg Gly Arg Gly Arg 
Oly Arg Gly Arg Gly ^g Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg 
P.O Gin ser Gly Gly Ser Gly Leu Gly Gly Asp Gly Gly Gly Gly Ala 
Gly Gly cys Z Val Gly Ser Gly "ly Gly Val Ala Pro Arg Arg Asp 
P.O val Pro Phe Pro Ser Gly Ser Ser Gly Pro Gly Pro Arg Gly Pro 
Arg III Thr Glu Ser Gly ^s Arg Met Asp Cys Pro Ala Leu Pro Pro 

-ICQ lb b 

^ ,T 1 2v^^ Two Qer Glv Leu Ser Ala Gly 

Gly Trp Lys Lys Glu Glu Val He Arg Lys Ser Giy u 

T 170 

X^ys ser ASP val Tyr Phe Ser Pro Ser Gly Lys Lys Phe Arg Ser 

^ys pro Gin Leu Ala Arg Tyr Leu Gly Asn Ala Val Asp Leu Ser Ser 
Phe ASP Phe Arg Thr Gly Lys III Met Pro Ser Lys Leu Gin Lys Asn 
.ys Gin Arg Leu Arg Asn Jsp Pro Leu Asn Gin Asn Lys Gly Lys Pro 
fsp Leu Asn Thr Thr Pro Xle Arg Gin Thr Ala Ser He Phe Lys 

Oln pro val Thr L^s Phe Thr Asn His Pro Ser Asn Lys Val Lys Ser 
;,sp pro Gin irg Met Asn Glu Gin Pro Arg Gin Leu Phe Trp Glu Lys 
^g Leu Gin Gly Leu Ser Ala III Asp Val Thr Glu Gin Xle He Lys 
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Thr 


Met 


Glu 


Leu 


Pro 


Lys 


Gly 


Leu 


Gin Gly 


Val 


Gly 


Pro Gly 


Ser 


Asn 


305 










310 










315 










320 


Asp 


Glu 


Thr 


Leu 


Leu 


Ser 


Ala 


val 


Ala 


Ser 


Ala 


Leu 


His 


Thr 


Ser 


Ser 








325 










330 










335 




Ala 


Pro 


He 


Thr 


Gly 


Gin 


Val 


Ser 


Ala 


Ala 


Val 


Glu 


Lys 


Asn 


Pro 


Ala 








340 










345 










350 






Val 


Trp 


Leu 


Asn 


Thr 


Ser 


Gin 


Pro 


Leu 


Cys 


Lys 


Ala 


Phe 


He 


Val 


Thr 




355 










360 










365 








Asp 


Glu 


Asp 


He 


Arg 


Lys 


Gin 


Glu 


Glu 


Arg 


Val 


Gin 


Gin 


Val 


Arg 


Lys 


370 










375 










380 






Ala 




Lys 


Leu 


Glu 


Glu 


Ala 


Leu 


Met 


Ala 


Asp 


He 


Leu 


Ser 


Arg 


Ala 


Asp 


385 










390 










395 






Ala 




400 


Thr 


Glu 


Glu 


Val 


Asp 


He 


Asp 


Met 


Asp 


Ser 


Gly Asp 


Glu 














405 










410 















<2ao> 7 

<211> 2392 
<212> DNA 
<213> Unknown 

<400> 7 

agcgggccga ggagccgggc gcaatggagc ggaagaggtg ggagtgcccg gcgctcccgc 
agggctggga gagggaagaa gtgcccagaa ggtcggggct gtcggccggc cacagggatg 
tcttttacta tagcccgagc gggaagaagt tccgcagcaa gccgcagctg gcgcgctacc 
tgggcggctc catggacctg agcaccttcg acttccgcac gggcaagatg ctgatgagca 
agatgaacaa gagccgccag cgcgtgcgct acgactcctc caaccaggtc aagggcaagc 
ccgacctgaa cacggcgctg cccgtgcgcc agacggcgtc catcttcaag cagccggtga 
ccaagattac caaccacccc agcaacaagg tcaagagcga cccgcagaag gcggtggacc 
agccgcgcca gctcttctgg gagaagaagc tgagcggcct gaacgccttc gacattgctg 
aggagctggt caagaccatg gacctcccca agggcctgca gggggtggga cctggctgca 
cggatgagac gctgctgtcg gccatcgcca gcgccctgca cactagcacc atgcccatca 
cgggacagct ctcggccgcc gtggagaaga accccggcgt atggctcaac accacgcagc 
ccctgtgcaa agccttcatg gtgaccgacg aggacatcag gaagcaggaa gagctggtgc 
agcaggtgcg gaagcggctg gaggaggcgc tgatggccga catgctggcg cacgtggagg 
agctggcccg tgacggggag gcgccgctgg acaaggcctg cgctgaggac gacgacgagg 
aagacgagga ggaggaggag gaggagcccg acccggaccc ggagatggag cacgtctagg 
gcagaggccc tgccgagagc ccgtgctgcc tgctggagcc gcctgcagac gcggtcctcg 
gccccacgtg aaccaggctc ggcggcgaag cccagccttg gagacaccca ggaggaaggc 
cgtgctcctg gctccctcct cggcccgtcc ccacttcccg gggcctcggg gcacacagct 1080 
ggggctgccc ccacccgaaa gaccctccac gctcgtcctc tacagagtcc ggcttcggga 1140 
agtgccgggt gctcctgggc cctgcctggc tccctacgac ctttgggctc gaggccagct 
cctccccatg cccgctgtcc cagctccttg agactggaga gcagccagca ggtgcccggc 
agctcggcgc cacggcttgc tgacagctgg gagggtttct cggtctggag gcgtagtttt 1320 
gaaactcaca tcacccactg tgcagcgtga ggacgggact ctggtctgct gtggggggca 13 80 
tgcaggacgg cgccactctc tgccctgcca tgcggctggt ggtgccacag agcctcaccg 
tgcctgagtg gcgtgcccag ggaggccgct ctccttcagt aaatgtaaca cagtcgaggc 
acgtcatcgg gcagccttcc ctgtgtgcca acgccagcct tcgcttctga aaaccaaact 1560 
ccagccgctg ccagtc^gga cttggtcgcc cggcgctgcc agaatgctcc actgccagcc 1620 
ggcccccctg cctcggtttc ccttctgttt agtggcgaca caggcaccca gctttggggt 1680 
ggtgctgacg ctcccagggg tgccaggagc cactgggaca gggtgaggct cccagacgct 1740 
cctcgaggtg cccagctctc cagggagctt ctggcccaag gcgttcttga gggatctgct 1800 
ccttaacccc ccagtgcctt ggcgagggca ggttccaagc cacagacgcc tgccccgagt 1860 
ggactttgcg gccagtccct gggtgccttc ctgggccctg cttgcccagt gagggttcct 192 0 
aacgggtggg ttcawtggcc tggcccvagc gagcccccac ctgcattgac cttaggccca 1980 
tagagagggc ctgtcccggt gctgccccag ccaaggatct ggtcgctgcc ccagggggac 
tgatgggcaa gagtcgcccc tgtggctgga ctgtgaccat ccctgatggg gcctgaccgc 
gggagctgag gaagcgccgc tccaccgtct gccctccaag gacccgcatg gaggcagtgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1200 
1260 



1440 
1500 



2040 
2100 
2160 
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gctggcagct tcctgctgct ccctgtcaga gtcaaagcac aaatcctcag 9ac9ggctca 2220 
^ggfccalgg cagccgaggg aagctccagg t99e9-cac ?|gaSgIgc 2340 

cgSlSI SraSSc ^f™. |ga4.sc.. gg 



<210> 8 
<211> 285 
<212> PRT 
<213> Unknown 



air«^ °V= Trp Olu cys P« .1. pro Oln .ly Trp Olu 
3X„ Glu val p'o »g s« Gly I-u Ser Al. Oly His Arg «P 
val Ph. Tyr s.r Pro s,r aly Ly= Lys Ph. Arg Ser I.y. Pro Oln 

:.eu Ala Arg Tyr I,.u Gly Oly «r M« Asp L.u S.r Thr Ph. Asp Ph. 
Arg ^r Oly I.y= M.t .... H« As„ l.ys Ket As» I-y. s.r Arg 01„ Arg 
Jal Arg Tyr Asp S.r As„ 01„ val Lys Oly Lys Pro Asp L.U As„ 
Thr Ala L.. Pro "l Arg 01„ Thr Al. s.r 11. Ph. Lys Oln Pro Val 
Thr Lys II. Thr As„ His Pro S.r As„ Lys val Lys S.r Asp Pro Gl. 
Lys Ala i"l ASP oln Pro Arg "in L.u Ph. Trp Olu Lys Lys L.U s.r 
Oly ser Ala Ph. Asp "l^ Ala olu Olu L.u Val Arg Thr „.t Asp 

r.u pro Lys Oly Leu o" Gly val Gly Pro Oly Cys Thr Asp Glu Thr 
.eu L.U S.r Al. 5" Ala S.r Ala L.u His Thr S.r Thr L.u Pro II. 
.Hr Oly Gl„ l'S S.r Ala Al. v.l Glu Lys As„ Pro Gly val Trp L.u 
Asu Thr 01„ pro L,u cys "s Al. Ph. Hat v.l Thr Asp Asp Asp 

11. «g Ly. Gin Olu Glu L^u Val oln 01„ V.l Arg Lys Arg Leu Olu 
Jlu Al. L.U «.t Al. ASP H,t Leu Al. His V.l Olu Olu Leu Al. Arg 
ASP Gly Olu Al. pro Leu Asp Lys Al. Cys Al. olu Glu Glu Olu Glu 
Glu Olu Glu Glu Glu Olu Glu Pro Glu Pro Olu Arg Val 

^ 280 ^ 



<210> 9 
<211> 17 
<212> DNA 
<213> Unknown 

<400> 9 17 
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